Implemenation of projects
Spring 2021
-
: Designing and implementation of an information retrieval system for movies dataset
- IR-System
- Main content:
- Preprocessing texts[case folding, stemming, lemmatization, stop-words]
- Positinal-index
- Dynamic indexing
- Index compression[gamma-code, variable-byte]
- Query correction[jaccard, bigram, edit distance]
- Searching and retrieving documents[TF-IDF]
- Evaluation of the system efficiency
-
Machine Learning methods in text-processing
- movies dataset classification and clustering
- main content:
- Preprocessing[TF-IDF]
- Classification[Naive-Bayes, KNN, SVM, Neural-Network]
- Clustering[K-means, Gaussian-Mixture-Models, Hierarchical-Clustering]
-
Recommender System for CS papers in academic microsoft:
- Recommender System
- main content:
- Implementation of a crawler and fetching papers' information
- Ranking papers by PageRank
- Ranking authors by HITS algorithm
- Recommeder system
- Content-based method
- Collaberative filtering
- A simple user interface