How does the stackoverflow clause work?

What is the theory of algorithms that, for example, generate sentences on the stackoverflow site for similar questions when you write them? Could you recommend some books on this subject?

+4
source share
2 answers

The algorithms you are talking about are mainly in 3 AI : NLP , ML and IR .

For example, to find the most similar 10 questions of a new question, you can extract n-gram from the texts of each question, calculate TF-IDF for each question of n-grams, then calculate the cosine convergence between the new question and all other questions and select 10 questions with the highest similarity .

Some free books you can read:
http://nlp.stanford.edu/IR-book/
http://infolab.stanford.edu/~ullman/mmds.html

And 2 free courses starting in late January:
http://www.nlp-class.org/
http://jan2012.ml-class.org/

Also (type of participation):
http://see.stanford.edu/see/courseinfo.aspx?coll=63480b48-8819-4efd-8412-263f1a472f5a
http://see.stanford.edu/see/courseinfo.aspx?coll=348ca38a-3a6d-4052-937d-cb017338d7b1

+5
source

I think this is due to the Rule Mining Association based on basket analysis. For a good reference, Web Data Mining by Bing Liu is definitely one of the best.

+1
source

Source: https://habr.com/ru/post/1388381/


All Articles