Pyspark:以表格格式显示Spark数据框

Question

我正在使用pyspark读取如下所示的实木复合地板文件:

I am using pyspark to read a parquet file like below:

my_df = sqlContext.read.parquet('hdfs://myPath/myDB.db/myTable/**')

然后，当我执行my_df.take(5)时，它将显示[Row(...)]，而不是像我们使用熊猫数据框时那样的表格式.

Then when I do my_df.take(5), it will show [Row(...)], instead of a table format like when we use the pandas data frame.

是否可以以表格格式(如熊猫数据框)显示数据框?谢谢！

Is it possible to display the data frame in a table format like pandas data frame? Thanks!

Answer 1

#1

 --- --- 
|  k|  v|
 --- --- 
|foo|  1|
|bar|  2|
 --- --- 
only showing top 2 rows

这篇好文章是转载于：学新通技术网

YouTube API 不能在 iOS (iPhone/iPad) 工作，但在桌面浏览器工作正常?