I have a list of names, some of them are fake, I need to use NLP and Python 3.1 to keep the real names and throw out the fake names

I don’t know where to start. I have never done NLP and only programmed in Python 3.1, which I should use. I look at http://www.linkedin.com and I need to collect all public profiles, and some of them have very fake names, like 'aaaaaa k dudujjek', and I was told that I can use NLP to find real names, where would I even start?

+3
source share
3 answers

, .

, , ? . "" , , .

LinkedIn , , /. -, , IMDB (, ) .

, : , , - . , , .

+3

, HMM, .. . NLTK [ ] HMM, , .

: AFAIK, NTLK Python 3.0

, , , , , , , , . , ( ) , , .

+1

, , "" - , , , . , , , , , , , , . : " - , " ", . , " ", .

, . . , , , , . , , .

, Google . google , , . , . , , .

0

Source: https://habr.com/ru/post/1735863/


All Articles