From b6723f02f8e90e25b195b9b35043b2150160acd0 Mon Sep 17 00:00:00 2001 From: toyota <toyota@cl.uni-heidelberg.de> Date: Fri, 30 Mar 2018 19:34:34 +0200 Subject: [PATCH] fix typos --- lib/README.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/lib/README.md b/lib/README.md index 4cf7703..c12d9d3 100644 --- a/lib/README.md +++ b/lib/README.md @@ -1,10 +1,10 @@ # CHERTOY - Creating language model with sent2vec -This is an implementation to provide necessary pre-processing steps for modeling an own sent2vec model which is used in the experiments. The two language models we built are a uni-gram and a bi-gram model over the wikipedia 2017 corpus. +This is an implementation to provide necessary preprocessing steps for modeling an own sent2vec model which is used in the experiments. The two language models we built are a uni-gram and a bi-gram model over the wikipedia 2017 corpus. ## RUNNING INSTRUCTIONS -## Pre-Processing Wikipedia Dump +## Preprocessing Wikipedia Dump Download Wikipedia Dump - Wikipedia Dumps for the english language is provided on https://meta.wikimedia.org/wiki/Data_dump_torrents#English_Wikipedia -- GitLab