Skip to content
Snippets Groups Projects
Commit f68009a2 authored by lkim's avatar lkim
Browse files

Add requirements.txt

parent bb8a4c36
No related branches found
No related tags found
No related merge requests found
......@@ -2,6 +2,10 @@
## Code for paper "G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment" [https://arxiv.org/abs/2303.16634]
## Model Evaluations
Score prediction via gemini_eval.py, llama3_eval.py, qwen_eval.py
## Experiments on SummEval dataset
Full dataset used by G-Eval paper
......@@ -15,7 +19,7 @@ Sample dataset used for CoT analysis
## Prompts and Evaluation Results
Prompts used to evaluate SummEval with GPT-4 & base and detailed prompts for CoT analysis are in prompts/summeval (by G-Eval paper)
Auto-CoT prompts are in prompts/cot_analysis
Auto-CoT prompts are in prompts/cot_analysis (created via auto_cot.py)
GPT-4 G-eval results on SummEval are in results (by G-Eval paper)
Other models results are in their respective folder
\ No newline at end of file
groq==0.20.0
openai==1.69.0
prettytable==3.16.0
protobuf==6.30.2
scipy==1.15.2
tqdm==4.67.1
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment