How to make request_cache work with many concurrent requests?

Question

How to make request_cache work with many concurrent requests?

I get and cache (for performance) a lot of URLs with something like:

import requests
import requests_cache
from multiprocessing.pool import ThreadPool

urls = ['http://www.google.com', ...]
with requests_cache.enabled():
    responses = ThreadPool(100).map(requests.get, urls)

However, I get a lot of errors for:

sqlite3.OperationalError: database is locked

Clearly, too many threads are accessing the cache at the same time.

So, it requests_cachesupports some kind of transaction, so that recording only happens when all threads are complete? For instance.

with requests_cache.enabled():
    with requests_cache.transaction():
        responses = ThreadPool(100).map(requests.get, urls)

+4

python caching concurrency python-requests

mchen Jan 03 '17 at 1:36

source share

No one has answered this question yet.

See related questions:

5116

How to check if a file exists without exceptions?

4268

How to combine two dictionaries in one expression?

3790

How can I safely create a subdirectory?

3474

How to list all the catalog files?

3428

How to sort a dictionary by value?

3235

How to check if a list is empty?

2849

How to make a flat list from a list of lists?

2621

How to create a chain of function decorators?

2601

How can I make a time delay in Python?

950

What is the difference between concurrency and concurrency?

How to make request_cache work with many concurrent requests?

More articles: