Information_Retrieval_and_Web_Search

TF-IDF index construction, query split computation, named entity disambiguation / named entity linking

Project Part1

Calculating TF-IDF for tokens and entities in the documents.

Split queries into different combinations of tokens and entities.

Calculate the query score for different combinations in order to select the one with highest score.

Check the Jupyter file inside to see the specification for part 1.

Project Part2

Use XGBoost with feature selection to build a model for Named Entity Disambiguation, applying TF-IDF and other NLP methods.

Train the model in Train.py and test the accuracy with test.py.

Check the Jupyter file inside to see the specification for part 2.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
Project_Part1		Project_Part1
Project_Part2		Project_Part2
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Information_Retrieval_and_Web_Search

TF-IDF index construction, query split computation, named entity disambiguation / named entity linking

Project Part1

Project Part2

About

Releases

Packages

Languages

miloooooz/Information_Retrieval_and_Web_Search

Folders and files

Latest commit

History

Repository files navigation

Information_Retrieval_and_Web_Search

TF-IDF index construction, query split computation, named entity disambiguation / named entity linking

Project Part1

Project Part2

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages