I experienced this same error and finally got into the solution. However, I immediately changed two things, so I’m not sure that both of them are necessary (of course, one of them).
Firstly, I made sure that my S3 data and my EMR system are in the same region. When I had this problem, my data was in the east of the USA, and EMR was in the west of the USA. I am standardized for US East (Virginia), aka us-east-1, aka US Standard, aka DEFAULT, aka N. Virginia. It may not have been required, but it did not stop.
Secondly, when I received an error message, I started the swing by following the steps in one of the videos and gave it the "-x local" option. It turns out that "-x local" seems to be guaranteed to prevent access to s3 (see below).
The solution starts the start without parameters.
Hope this helps.
Gilles
hadoop@domU-12-31-39-09-24-66 :~$ pig -x local 2013-07-03 00:27:15,321 [main] INFO org.apache.pig.Main - Apache Pig version 0.11.1-amzn (rexported) compiled Jun 24 2013, 18:37:44 2013-07-03 00:27:15,321 [main] INFO org.apache.pig.Main - Logging error messages to: /home/hadoop/pig_1372811235317.log 2013-07-03 00:27:15,379 [main] INFO org.apache.pig.impl.util.Utils - Default bootup file /home/hadoop/.pigbootup not found 2013-07-03 00:27:15,793 [main] INFO org.apache.pig.backend.hadoop.executionengine.HExecutionEngine - Connecting to hadoop file system at: file:/// grunt> ls s3://xxxxxx.xx.rawdata 2013-07-03 00:27:23,463 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 2999: Unexpected internal error. AWS Access Key ID and Secret Access Key must be specified as the username or password (respectively) of a s3 URL, or by setting the fs.s3.awsAccessKeyId or fs.s3.awsSecretAccessKey properties (respectively). Details at logfile: /home/hadoop/pig_1372811235317.log grunt> quit hadoop@domU-12-31-39-09-24-66 :~$ pig 2013-07-03 00:28:04,769 [main] INFO org.apache.pig.Main - Apache Pig version 0.11.1-amzn (rexported) compiled Jun 24 2013, 18:37:44 2013-07-03 00:28:04,771 [main] INFO org.apache.pig.Main - Logging error messages to: /home/hadoop/pig_1372811284764.log 2013-07-03 00:28:04,873 [main] INFO org.apache.pig.impl.util.Utils - Default bootup file /home/hadoop/.pigbootup not found 2013-07-03 00:28:05,639 [main] INFO org.apache.pig.backend.hadoop.executionengine.HExecutionEngine - Connecting to hadoop file system at: hdfs://10.210.43.148:9000 2013-07-03 00:28:08,765 [main] INFO org.apache.pig.backend.hadoop.executionengine.HExecutionEngine - Connecting to map-reduce job tracker at: 10.210.43.148:9001 grunt> ls s3://xxxxxx.xx.rawdata s3://xxxxxx.xx.rawdata/rawdata<r 1> 19813 s3://xxxxxx.xx.rawdata/rawdata.csv<r 1> 19813 grunt>
source share