Hi,
I have a hive table define with tblproperties(“skip.header.line.count”=“1”) .
When I query (select *) from hive itself, its showing proper result but when I execute the select from pyspark, the header is not skipped.
Could you please help.
df=sqlContext.sql(“select * from sde_db.cust_data_master”)
df.show(2)
±------±---------±---------±--------±------------±------±----±----±-----±-----------±-----------±-------------------±------+
|cust_id| biz_dt|first_name|last_name| address| city|state| post|phone1| phone2| email| web|country|
±------±---------±---------±--------±------------±------±----±----±-----±-----------±-----------±-------------------±------+
| null| null|first_name|last_name| address|country| city|state| post| phone1| phone2| email| au|
| 1|2018-01-09| Rebbecca| Didio|171 E 24th St| AU|Leith| TA| 7315|03-8174-9123|0458-665-290|rebbecca.didio@di…| au|
±------±---------±---------±--------±------------±------±----±----±-----±-----------±-----------±-------------------±------+