How to transfer a file using curl from one server to another (limited server resources)

Question

How to transfer a file using curl from one server to another (limited server resources)

My API server has very limited disk space (500 MB) and memory (1 GB). One of the API calls that it receives is getting the file. The user calls the API and passes the download URL.

My server’s “goal” is to upload this file to Amazon S3. Unfortunately, I cannot ask the user to upload the file directly to S3 (part of the requirements).

The problem is that sometimes these are huge files (10 GB) and saving them to disk, and then downloading to S3 is not an option (500 MB limit on disk).

My question is: how can I "pipe" a file from an S3 input URL using curl Linux?

Note. I managed to transfer it in different ways, but either it first tries to download the whole file and does not work, or I hit a memory error and curls. I assume the download is much faster than the download, so the channel buffer / memory grows and explodes (1 GB of memory on the server) when I receive 10 GB files.

Is there a way to achieve what I'm trying to do using curl and piping?

Thanks, Jack

+5

linux curl amazon-s3

Joe Aug 10 '17 at 16:45

source share

1 answer

terafl0ps · Answer 1 · 2018-01-20T01:40:14+0000

Another SO user asked a similar question about curl messages from stdin. See using a tube for curly data .

After you can publish your download stream from the output of the first output stream with a swirl, if you run out of memory, because you download faster than you can download, look at mbuffer . I did not use it myself, but it seems to be designed specifically for this kind of problem.

Finally, if all else fails, I think you could use the curl --limit rate parameter to block the download and download transfer rates to some identical and stable values. This potentially under-utilizes bandwidth and will not scale well with multiple concurrent upload / download streams, but for some one-off batch process this might be enough.

How to transfer a file using curl from one server to another (limited server resources)

More articles: