当 运行 python 用于 Kmeans 时,行继续后出现意外字符
Unexpected character after line continuation when running python for Kmeans
我在 Spark
中实现 K-Means
,当我尝试使用 spark-submit Kmeans.py
运行 我的脚本时,我不断收到错误 unexpected character after line continuation
。我正在使用反斜杠进行续行,但我看不出我是如何使用它的。下面我将 post 我的算法。
import pyspark
from pyspark.context import SparkContext
from pyspark import SparkConf
from pyspark.sql import SparkSession, SQLContext, Row
from pyspark.sql.functions import *
from pyspark.ml.clustering import KMeans
import json
import os
conf = SparkConf()
sc = SparkContext(conf = conf)
sc.setLogLevel("ERROR")
spark = SparkSession \
.builder \
.appName("Phone Book - Country Look up") \
.config("spark.some.config.option", "some-value") \
.getOrCreate()
dataset = spark.read.format("libsvm") \
.load("/home/jay/Assignment6/Input.txt")
dataset.show(200)
dataset.printSchema()
kmeans = KMeans().setK(2).setSeed(1)
model = kmeans.fit(dataset)
var1 = model.computeCost(dataset)
print("Within Set Sum of Squared Errors =" + str(var1))
centers = model.clusterCenters()
print("Cluster centers: ")
for center in centers:
print(center)
我认为你打错了:.builder
应该是 .builder()
。
连续字符 之后出现意外字符几乎总是意味着在\
之后有一个space,这当然很难看到。如果它在第 15 行报告,则问题可能在第 14 行。您可以通过删除连续字符并将表达式括在括号中来解决此类问题:
spark =(SparkSession
.builder
.appName("Phone Book - Country Look up")
.config("spark.some.config.option", "some-value")
.getOrCreate())
我在 Spark
中实现 K-Means
,当我尝试使用 spark-submit Kmeans.py
运行 我的脚本时,我不断收到错误 unexpected character after line continuation
。我正在使用反斜杠进行续行,但我看不出我是如何使用它的。下面我将 post 我的算法。
import pyspark
from pyspark.context import SparkContext
from pyspark import SparkConf
from pyspark.sql import SparkSession, SQLContext, Row
from pyspark.sql.functions import *
from pyspark.ml.clustering import KMeans
import json
import os
conf = SparkConf()
sc = SparkContext(conf = conf)
sc.setLogLevel("ERROR")
spark = SparkSession \
.builder \
.appName("Phone Book - Country Look up") \
.config("spark.some.config.option", "some-value") \
.getOrCreate()
dataset = spark.read.format("libsvm") \
.load("/home/jay/Assignment6/Input.txt")
dataset.show(200)
dataset.printSchema()
kmeans = KMeans().setK(2).setSeed(1)
model = kmeans.fit(dataset)
var1 = model.computeCost(dataset)
print("Within Set Sum of Squared Errors =" + str(var1))
centers = model.clusterCenters()
print("Cluster centers: ")
for center in centers:
print(center)
我认为你打错了:.builder
应该是 .builder()
。
连续字符 之后出现意外字符几乎总是意味着在\
之后有一个space,这当然很难看到。如果它在第 15 行报告,则问题可能在第 14 行。您可以通过删除连续字符并将表达式括在括号中来解决此类问题:
spark =(SparkSession
.builder
.appName("Phone Book - Country Look up")
.config("spark.some.config.option", "some-value")
.getOrCreate())