I am developing a ruby ββparser that parses some uneven text data. Can someone tell me where I can get a lot of open text data?
Here you will get a list of many:
http://www.quora.com/Data/Where-can-I-get-large-datasets-open-to-the-public
And my fav:
http://ftp.sunet.se/mirror/archive/ftp.sunet.se/pub/tv+movies/imdb/
You can copy Wikipedia (or just run a bunch of it through lynx -dump ). It will also give you an extensive source of non-English text. The Gutenberg project will be another good source of a lot of plain text.
lynx -dump
Source: https://habr.com/ru/post/886677/More articles:DI with disposable objects - c #Audio Stream Using Android MediaPlayer - androidHow should I create a GWT application in a WAR file - gwtCreating a table with three HTML columns with dynamic data in Django - pythonin asp.net mvc, how can I pass an array of integers as a parameter - jqueryTwo-factor authentication system - securityshortcut to define goto method in rails with textmate - ruby-on-rails-3Creating an XML document using nodeList - javaBar chart normalization in gnuplot - gnuplothow to include one page in another in gwt ui.xml - gwtAll Articles