diff --git a/elise/README.md b/elise/README.md index 54f4b02eee7efbd98994ad4d25eeddcdd667fd9c..9da8853da27334db100a7789a3ea19197f9977c7 100644 --- a/elise/README.md +++ b/elise/README.md @@ -4,19 +4,24 @@ Extracted Literal Search is a simple method for Retrieval Augmented Generation ( For small datasets where most of the data fields are known literals, this method is easier to implement and might provide better results than Similarity Search of embeddings. - ## Installation Make sure you have `poetry` installed. + > pip install --local poetry Install the package. + > poetry install +Export your Huggingface token. + +> export HUGGING_FACE_HUB_TOKEN=<TOKEN> + Run the vLLM server. For example with a quantized Llama-2 model. -> poetry run python -m outlines.serve.serve --model="TheBloke/Llama-2-7b-Chat-GPTQ" -q gptq +> poetry run python -m outlines.serve.serve --model="TheBloke/Llama-2-7b-Chat-GPTQ" -q gptq ## Usage -Define your domain specific prompts. \ No newline at end of file +Define your domain specific prompts.