I have twitter data in avro format. I am trying to insert those data into hive.
Here’s what I did:
- Created a schema of the avro file using the command : avro-tools getschema hdfs://ip-172-31-53-48.ec2.internal:8020/user/tellsanjeev58268/flume/tweets/FlumeData.150
1743718856 > schema.avsc - Moved the schema to HDFS.
- Built a table using the schema in hive. Created Successfully
- Loaded the data into the table. Loaded successfully
While running “select * from table_name” I am getting the following error:
java.io.IOException: org.apache.avro.AvroRuntimeException: java.io.IOException: Block size invalid or too large for this implementation: -40