integreat-chatbot

We use haystack to search the documents

Script 1: Split and format json response

Currently there is a script, which splits the response from e.g. https://cms.integreat-app.de/api/muenchen/de/pages into different documents. It also removes the HTML tags. These generated files can easily be converted and stored by haystack document. To run this script use python3 parse-files.py

Script 2: Run first prototyp

To run the first prototype you need to download a LLM. There are different available on huggingface. For example google-flan-t5-xl

Before running the script some dependencies need to be installed, therefore create a virtual environment and activate it. The script will download the llm and it will be saved in home/.cache/huggingface, so expect the first run of the script to be significantly slower than the following ones.

python3 -m venv .venv
source .venv/bin/activate

then install

pip install --upgrade pip
pip install pip install farm-haystack==1.17.2

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
test-data		test-data
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
parse-files.py		parse-files.py
program.py		program.py
requirments.txt		requirments.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

integreat-chatbot

Script 1: Split and format json response

Script 2: Run first prototyp

About

Releases

Packages

Languages

License

digitalfabrik/integreat-chatbot

Folders and files

Latest commit

History

Repository files navigation

integreat-chatbot

Script 1: Split and format json response

Script 2: Run first prototyp

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages