Skip to content
Snippets Groups Projects
Commit b6723f02 authored by toyota's avatar toyota
Browse files

fix typos

parent 54af8484
No related branches found
No related tags found
No related merge requests found
# CHERTOY - Creating language model with sent2vec
This is an implementation to provide necessary pre-processing steps for modeling an own sent2vec model which is used in the experiments. The two language models we built are a uni-gram and a bi-gram model over the wikipedia 2017 corpus.
This is an implementation to provide necessary preprocessing steps for modeling an own sent2vec model which is used in the experiments. The two language models we built are a uni-gram and a bi-gram model over the wikipedia 2017 corpus.
## RUNNING INSTRUCTIONS
## Pre-Processing Wikipedia Dump
## Preprocessing Wikipedia Dump
Download Wikipedia Dump
- Wikipedia Dumps for the english language is provided on https://meta.wikimedia.org/wiki/Data_dump_torrents#English_Wikipedia
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment