How to save scraw crawl command output

Question

How to save scraw crawl command output

I am trying to save the output of the scraw crawl command that I tried scrapy crawl someSpider -o some.json -t json >> some.text But that didn’t work ... can some authority tell me how I can save the output to text file ... I mean logs and information printed using scrapy ...

+9

python scrapy

vaibhav jain May 20, '13 at 13:15

source share

5 answers

You can add these lines to your settings.py :

 LOG_STDOUT = True LOG_FILE = '/tmp/scrapy_output.txt'

And then start a regular scan:

 scrapy crawl someSpider

+22

claire_ May 21 '13 at 11:15

source share

if you want to get the result of the runpider command.

 scrapy runspider scraper.py -o some.json -t json 2> some.text

This also works.

0

EVX Mar 04 '17 at 1:32

source share

You can use nohup :

 nohup scrapy crawl someSpider &

The log will be stored in nohup.out

0

Hackaholic Sep 28 '18 at 11:06

source share

For all scrapy commands scrapy you can add --logfile NAME_OF_FILE to enter the file, for example

 scrapy crawl someSpider -o some.json --logfile some.text

There are two other useful command line options for logging:

-L or --loglevel to control the registration level, e.g. -L INFO (default is DEBUG )
--nolog to completely disable logging

These commands are described here .

0

tomjn Aug 26 '19 at 8:10

source share

Joshuaboshi · Accepted Answer · 2013-05-20T13:26:25+0000

You also need to redirect stderr. You redirect only stdout. You can redirect it like this:

scrapy crawl someSpider -o some.json -t json 2> some.text

The key is the number 2 that stderr selects as the source for redirection.

If you want to redirect stderr and stdout to the same file, you can use:

scrapy crawl someSpider -o some.json -t json &> some.text

More on output redirection: http://tldp.org/HOWTO/Bash-Prog-Intro-HOWTO-3.html

How to save scraw crawl command output

More articles: