Figuring out where to add punctuation to bad user content?

Is there a way to use NLP or an existing library to add missing punctuation to bad user content?

For example, this line:

Today is Tuesday I went to work on Monday Friday was off

will become:

Today is Tuesday. I went to work on Monday. Friday was off.

+3
source share
2 answers

I think this problem falls under the uncertainty of the scope of the proposal http://en.wikipedia.org/wiki/Sentence_boundary_disambiguation . I used the OpenNLP option and was pleased with the results.

+1
source

I briefly talked about this problem (only with partial success).

; , , , , @Rahul . , . , :

, , , .

, , . , ?

, ( ).

, n-gram . LingPipe - . , ( ), , . : , 8-12 , ; , , .

, , , , . , (, ) ( n ).

+1

Source: https://habr.com/ru/post/1536410/


All Articles