Questions tagged [apache-zeppelin]

Apache Zeppelin is a web-based notebook that enables data-driven interactive data analytics. You can make beautiful data-driven, interactive and collaborative documents with SQL, Python, Scala and more. It also supports Markdown syntax.

Apache Zeppelin home page

1379 questions
27
votes
1 answer

No interpreters available in Zeppelin

I have just installed the following on my Mac (Yosemite 10.10.3): oracle java 1.8 update 45 scala 2.11.6 spark 1.4 (precompiled release: http://d3kbcqa49mib13.cloudfront.net/spark-1.4.0-bin-hadoop2.6.tgz) zeppelin from source…
Samuel Kerrien
  • 6,527
  • 2
  • 25
  • 32
21
votes
1 answer

What to set `SPARK_HOME` to?

Installed apache-maven-3.3.3, scala 2.11.6, then ran: $ git clone git://github.com/apache/spark.git -b branch-1.4 $ cd spark $ build/mvn -DskipTests clean package Finally: $ git clone https://github.com/apache/incubator-zeppelin $ cd…
A T
  • 10,508
  • 14
  • 85
  • 137
20
votes
4 answers

Remove Temporary Tables from Apache SQL Spark

I have registertemptable in Apache Spark using Zeppelin below: val hvacText = sc.textFile("...") case class Hvac(date: String, time: String, targettemp: Integer, actualtemp: Integer, buildingID: String) val hvac = hvacText.map(s =>…
19
votes
2 answers

How can I pretty print a data frame in Zeppelin/Spark/Scala?

I am using Spark 2 and Scala 2.11 in a Zeppelin 0.7 notebook. I have a dataframe that I can print like this: dfLemma.select("text", "lemma").show(20,false) and the output looks…
schoon
  • 1,878
  • 3
  • 26
  • 56
19
votes
4 answers

Why does SparkContext randomly close, and how do you restart it from Zeppelin?

I am working in Zeppelin writing spark-sql queries and sometimes I suddenly start getting this error (after not changing code): Cannot call methods on a stopped SparkContext. Then the output says further down: The currently active SparkContext was…
The Puma
  • 1,242
  • 2
  • 13
  • 27
19
votes
6 answers

How to suppress printing of variable values in zeppelin

Given the following snippet: val data = sc.parallelize(0 until 10000) val local = data.collect println(s"local.size") Zeppelin prints out the entire value of local to the notebook screen. How may that behavior be changed?
StephenBoesch
  • 46,509
  • 64
  • 237
  • 432
17
votes
1 answer

How to check version of Spark and Scala in Zeppelin?

When I run interactive spark-shell, I show spark version (2.2.0) and scala version (2.11.8) Welcome to ____ __ / __/__ ___ _____/ /__ _\ \/ _ \/ _ `/ __/ '_/ /___/ .__/\_,_/_/ /_/\_\ version 2.2.0 /_/ Using…
Haha TTpro
  • 3,921
  • 5
  • 28
  • 54
17
votes
9 answers

Getting NullPointerException when running Spark Code in Zeppelin 0.7.1

I have installed Zeppelin 0.7.1. When I tried to execute the Example spark program(which was available with Zeppelin Tutorial notebook), I am getting the following error java.lang.NullPointerException at…
Raj
  • 2,185
  • 5
  • 27
  • 46
17
votes
3 answers

converting pandas dataframes to spark dataframe in zeppelin

I am new to zeppelin. I have a usecase wherein i have a pandas dataframe.I need to visualize the collections using in-built chart of zeppelin I do not have a clear approach here. MY understanding is with zeppelin we can visualize the data if it is a…
Bala
  • 655
  • 2
  • 6
  • 21
15
votes
2 answers

What is the difference between yarn and yarn install?

I am trying to install an additional plugin on Apache Zeppelin using Helium. The Installation command using by Helium to download this package is below: com.github.eirslett.maven.plugins.frontend.lib.TaskRunnerException: 'yarn install…
gnish9
  • 187
  • 1
  • 7
15
votes
2 answers

Does Apache Zeppelin have intellisense?

Does Apache Zeppelin have intellisense/auto-completion support in its notebook ui? if so how do I use it? I tried searching on the internet but was unsuccessful to find a source which works. This https://github.com/NFLabs/zeppelin/issues/248 says…
DesirePRG
  • 5,584
  • 12
  • 59
  • 99
15
votes
11 answers

Hello world in zeppelin failed

I just installed apache zeppelin (built from latest source from git repo) and successfully saw it is up and running in the port 10008. I created a new note book with a single line of code val a = "Hello World!" And run this paragraph and saw the…
Bala
  • 655
  • 2
  • 6
  • 21
14
votes
4 answers

Spark + s3 - error - java.lang.ClassNotFoundException: Class org.apache.hadoop.fs.s3a.S3AFileSystem not found

I have a spark ec2 cluster where I am submitting a pyspark program from a Zeppelin notebook. I have loaded the hadoop-aws-2.7.3.jar and aws-java-sdk-1.11.179.jar and place them in the /opt/spark/jars directory of the spark instances. I get a…
peterlandis
  • 297
  • 1
  • 2
  • 10
14
votes
1 answer

Field "features" does not exist. SparkML

I am trying to build a model in Spark ML with Zeppelin. I am new to this area and would like some help. I think i need to set the correct datatypes to the column and set the first column as the label. Any help would be greatly appreciated, thank…
Young4844
  • 227
  • 1
  • 3
  • 12
13
votes
4 answers

Zeppelin: How to restart sparkContext in zeppelin

I am using Isolated mode of zeppelins spark interpreter, with this mode it will start a new job for each notebook in spark cluster. I want to kill the job via zeppelin when the notebook execution is completed. For this I did sc.stop this stopped the…
eatSleepCode
  • 3,911
  • 6
  • 37
  • 77
1
2 3
91 92