Skip to content
Snippets Groups Projects
Commit 6bd4faf5 authored by friebolin's avatar friebolin
Browse files

Update links

parent fe709a4b
No related branches found
No related tags found
No related merge requests found
......@@ -89,7 +89,7 @@ As a comparatively safe (= label preserving) data augmentation strategy, we sele
- ✅ For each sentence, the top 5 paraphrases are kept, using [nucleus/topp](https://fairseq.readthedocs.io/en/latest/command_line_tools.html) as our sampling method, likewise for diversity reasons.
- 🔥 We test two versions: Generating paraphrases using a lower (0.8) and higher (1.2) **`temperature`**. This hyperparameter determines how *creative* the translation model becomes: higher `temperature` leads to more linguistic variety, lower `temperature` to results closer to the original sentence.
- 🔥 We test two versions: Generating paraphrases using a lower (0.8) and higher (1.2) `temperature`. This hyperparameter determines how *creative* the translation model becomes: higher `temperature` leads to more linguistic variety, lower `temperature` to results closer to the original sentence.
- 🌈 The diversity of the paraphrases is evaluated via the Longest Common Subsequence [(LCS)](https://docs.python.org/3/library/difflib.html#sequencematcher-objects) score in comparison to their respective original sentence.
......@@ -191,18 +191,18 @@ For `<COMMAND>` you must enter one of the commands you find in the list below, w
## 🏯 Code-Structure <a name="code-structure"></a>
- ⚙️ `requirements.txt`: All necessary modules to install.
- 📱 `main.py`: Our main code file which does ...
- 💻 `code`: Here, you can find all code files for our different models and data augmentation methods.
- 📀 `data`: Find all datasets in this folder.
- 🗂️ `original_datasets`: *Semeval_loc*, *Semeval_org*, *Relocar* in their original form.
- 🗂️ `backtranslation`: Contains unfiltered generated paraphrases.
- 🗂️ `paraphrases`: Contains only filtered paraphrases.
- 🗂️ `fused_datasets`: Contains original datasets fused with filtered paraphrases. Ready to be used for training the models.
- 📝 `documentation`: Contains our organizational data and visualizations.
- 🗂️ `organization`: Our research plan, presentation, final reports.
- 🗂️ `images`: Contains all relevant visualizations.
- 🗂️ `results`: Find tables of our results.
- ⚙️ [`requirements.txt`](requirements.txt): All necessary modules to install.
- 📱 [`main.py`](main.py): Our main code file which does ...
- 💻 [`Code`](code): Here, you can find all code files for our different models and data augmentation methods.
- 📀 [`data`](data): Find all datasets in this folder.
- 🗂️ [`backtranslations`](data/backtranslations): Contains unfiltered generated paraphrases.
- 🗂️ [`fused_datasets`](data/fused_datasets): Contains original datasets fused with filtered paraphrases. Ready to be used for training the models.
- 🗂️ [`original_datasets`](data/original_datasets): *Semeval_loc*, *Semeval_org*, *Relocar* in their original form.
- 🗂️ [`paraphrases`](data/paraphrases): Contains only filtered paraphrases.
- 📝 [`documentation`](documentation): Contains our organizational data and visualizations.
- 🗂️ [`images`](documentation/images): Contains all relevant visualizations.
- 🗂️ [`organization`](documentation/organization): Our research plan, presentation, final reports.
- 🗂️ [`results`](documentation/results): Find tables of our results.
***
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment