Mastercard Interview Question | Search Engine
Question
A person using a search engine needs to find something. How do you come up with an algorithm that will predict what the user needs after they type only a few letters?
in progress
1
Machine Learning
4 years
1 Answer
870 views
Great Grand Master 0
Answer ( 1 )
Basically I would handle this as a text classification problem:
Here are steps that i will follow for preprocessing which can improve my prediction
1.) Use Good tokenizer(textblob,stanford tokenizer)
2.) Try Lemmatization, stemming always not perform well in case news article.
3.) word segmentation
4.) Normalization (equivalence classing of terms)
For selecting model
1.) In your example above, we classified the document by comparing the number of matching terms in the document vectors. In the real world numerous more complex algorithms exist for classification such as Support Vector Machines (SVMs), Naive Bayes and Decision Trees , Maximum Entropy.
2.) You can think your problem as making clusters of news and getting semantic relationship of source news from these cluster. You can try topic modelling(LDA and LSA) and Doc2vec/word2vec technique for getting vector for document/word and then use these vectors for classification task.