I am trying to write an algorithm (which, I believe, will rely on natural language processing methods) to "populate" the list of search terms. There is probably a name for this kind of thing that I don't know about. What is the name of this problem, and what algorithm will give me the following behavior?
Input:
docs = [ "I bought a ticket to the Dolphin Watching cruise", "I enjoyed the Dolphin Watching tour", "The Miami Dolphins lost again!", "It was good going to that Miami Dolphins game" ], search_term = "Dolphin"
Conclusion:
["Dolphin Watching", "Miami Dolphins"]
It should be understood in principle that if Dolphin appears at all, it is almost always either in the Dolphin Observation bigrams or the Miami Dolphins. Solutions in Python are preferable.
source share