美文网首页
org.apache.spark.SparkException:

org.apache.spark.SparkException:

作者: frank3 | 来源:发表于2020-03-19 16:12 被阅读0次

使用sbin/start-thriftserver.sh --help 报错如下(重点报错信息)

20/03/19 15:08:22 ERROR SparkContext: Error initializing SparkContext.
org.apache.spark.SparkException: A master URL must be set in your configuration
    at org.apache.spark.SparkContext.<init>(SparkContext.scala:368)
    at org.apache.spark.SparkContext$.getOrCreate(SparkContext.scala:2520)
    at org.apache.spark.sql.SparkSession$Builder$$anonfun$7.apply(SparkSession.scala:935)
    at org.apache.spark.sql.SparkSession$Builder$$anonfun$7.apply(SparkSession.scala:926)
    at scala.Option.getOrElse(Option.scala:121)
    at org.apache.spark.sql.SparkSession$Builder.getOrCreate(SparkSession.scala:926)
    at org.apache.spark.sql.hive.thriftserver.SparkSQLEnv$.init(SparkSQLEnv.scala:48)
    at org.apache.spark.sql.hive.thriftserver.HiveThriftServer2$.main(HiveThriftServer2.scala:79)
    at org.apache.spark.sql.hive.thriftserver.HiveThriftServer2.main(HiveThriftServer2.scala)
20/03/19 15:08:22 ERROR Utils: Uncaught exception in thread main

从源码中查找问题(spark2.4.5 SparkContext.scala 367行)

if (!_conf.contains("spark.master")) {
      throw new SparkException("A master URL must be set in your configuration")
    }

查看sbin/start-thriftserver.sh内容

CLASS="org.apache.spark.sql.hive.thriftserver.HiveThriftServer2"
......此次省略
"${SPARK_HOME}"/bin/spark-submit --help 2>&1 | grep -v Usage 1>&2
  echo
  echo "Thrift server options:"
  "${SPARK_HOME}"/bin/spark-class $CLASS --help 2>&1 | grep -v "$pattern" 1>&2

使用bin/spark-class -Dspark.master=yarn org.apache.spark.sql.hive.thriftserver.HiveThriftServer2 --help出了新的错误

20/03/19 15:20:10 DEBUG AbstractService: Service: org.apache.hadoop.yarn.client.api.impl.YarnClientImpl entered state INITED
Exception in thread "main" java.lang.NoClassDefFoundError: com/sun/jersey/api/client/config/ClientConfig
    at org.apache.hadoop.yarn.client.api.TimelineClient.createTimelineClient(TimelineClient.java:55)
    at org.apache.hadoop.yarn.client.api.impl.YarnClientImpl.createTimelineClient(YarnClientImpl.java:181)
    at org.apache.hadoop.yarn.client.api.impl.YarnClientImpl.serviceInit(YarnClientImpl.java:168)
    at org.apache.hadoop.service.AbstractService.init(AbstractService.java:163)
    at org.apache.spark.deploy.yarn.Client.submitApplication(Client.scala:161)
    at org.apache.spark.scheduler.cluster.YarnClientSchedulerBackend.start(YarnClientSchedulerBackend.scala:57)
    at org.apache.spark.scheduler.TaskSchedulerImpl.start(TaskSchedulerImpl.scala:183)
    at org.apache.spark.SparkContext.<init>(SparkContext.scala:501)
    at org.apache.spark.SparkContext$.getOrCreate(SparkContext.scala:2520)
    at org.apache.spark.sql.SparkSession$Builder$$anonfun$7.apply(SparkSession.scala:935)
    at org.apache.spark.sql.SparkSession$Builder$$anonfun$7.apply(SparkSession.scala:926)
    at scala.Option.getOrElse(Option.scala:121)
    at org.apache.spark.sql.SparkSession$Builder.getOrCreate(SparkSession.scala:926)
    at org.apache.spark.sql.hive.thriftserver.SparkSQLEnv$.init(SparkSQLEnv.scala:48)
    at org.apache.spark.sql.hive.thriftserver.HiveThriftServer2$.main(HiveThriftServer2.scala:79)
    at org.apache.spark.sql.hive.thriftserver.HiveThriftServer2.main(HiveThriftServer2.scala)
Caused by: java.lang.ClassNotFoundException: com.sun.jersey.api.client.config.ClientConfig
    at java.net.URLClassLoader.findClass(URLClassLoader.java:381)
    at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
    at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:338)
    at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
    ... 16 more

进过查找,发现需要配置spark.hadoop.yarn.timeline-service.enabled=false 参考[spark 2.0 on yarn 问题]
(https://blog.csdn.net/ruiyiin/article/details/76530376
), 具体配置功能参见Hadoop YARN Timeline Service Integrationhttps://github.com/steveloughran/spark-timeline-integration/blob/master/yarn-timeline-history/src/main/docs/timeline.md

bin/spark-class -Dspark.hadoop.yarn.timeline-service.enabled=false -Dspark.master=yarn org.apache.spark.sql.hive.thriftserver.HiveThriftServer2 --help

相关文章

网友评论

      本文标题:org.apache.spark.SparkException:

      本文链接:https://www.haomeiwen.com/subject/hallyhtx.html