Spark Sql and Hql
[cloudera@quickstart ~]$ sudo find / -name
'hive-site.xml'
[cloudera@quickstart ~]$ sudo chmod -R 777
/usr/lib/spark/conf
[cloudera@quickstart ~]$ cp
/etc/hive/conf.dist/hive-site.xml
/usr/lib/spark/conf
_____________________________________
from hive-site.xml -->
hive.metastore.warehouse.dir
from spark 2.0.0 onwards above opt is
depricated
use following option..
------> spark.sql.warehouse.dir
_____________________________________________
____ [ tested in cloudera 5.8
spark version 1.6.0 ]
[cloudera@quickstart ~]$ls
/usr/lib/hue/apps/beeswax/data/sample_07.csv
[cloudera@quickstart ~]$ head -n 2
/usr/lib/hue/apps/beeswax/data/sample_07.csv
_____________________
val hq = new
org.apache.spark.sql.hive.HiveContext(sc)
hq.sql("create database sparkdb")
hq.sql("CREATE TABLE sample_07 (code
string,description string,total_emp
int,salary int) ROW FORMAT DELIMITED FIELDS
TERMINATED BY '\t' STORED AS TextFile")
[cloudera@quickstart ~]$ hadoop fs -mkdir
sparks
[cloudera@quickstart ~]$ hadoop fs -
copyFromLocal
/usr/lib/hue/apps/beeswax/data/sample_07.csv
sparks
[cloudera@quickstart ~]$ hadoop fs -ls
sparks
hq.sql("LOAD DATA INPATH
'/user/cloudera/sparks/sample_07.csv'
OVERWRITE INTO TABLE sample_07")
val df = hq.sql("SELECT * from sample_07")
__________________________________________
scala> df.filter(df("salary") > 150000).show
()
+-------+--------------------+---------
+------+
| code| description|total_emp|
salary|
+-------+--------------------+---------
+------+
|11-1011| Chief executives| 299160|
151370|
|29-1022|Oral and maxillof...| 5040|
178440|
|29-1023| Orthodontists| 5350|
185340|
|29-1024| Prosthodontists| 380|
169360|
|29-1061| Anesthesiologists| 31030|
192780|
|29-1062|Family and genera...| 113250|
153640|
|29-1063| Internists, general| 46260|
167270|
|29-1064|Obstetricians and...| 21340|
183600|
|29-1067| Surgeons| 50260|
191410|
|29-1069|Physicians and su...| 237400|
155150|
+-------+--------------------+---------
+------+
____________________________________________
No comments:
Post a Comment