Skip to content

Latest commit

 

History

History
6 lines (5 loc) · 351 Bytes

README.md

File metadata and controls

6 lines (5 loc) · 351 Bytes

query-doc-matcher

This repo contains the source code to obtain the matching words between a topic (query) and a document. The matching words are computed by taking into account also stopwords removal, stemming and lemmatization. The matching words are ranked by tf-idf scores.

For testing purposes, there is also a demo function: matcher.demo().