C# (CSharp) ISparkSessionProxy.CreateDataFrameの例

プログラミング言語: C# (CSharp)

クラス/型: ISparkSessionProxy

メソッド/関数: CreateDataFrame

hotexamples.comのコード掲載数: 1

C# (CSharp) ISparkSessionProxy.CreateDataFrame - 1件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたC# (CSharp)のISparkSessionProxy.CreateDataFrameの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

よく使われるメソッド

表示非表示

CreateDataFrame(1)

NewSession(1)

Read(1)

Sql(1)

Stop(1)

Table(1)

コード例 #1

ファイルを表示

ファイル: SparkSession.cs プロジェクト: yeluoyunfei/Mobius

        /// <summary>
        /// Creates a <see cref="DataFrame"/> from a RDD containing array of object using the given schema.
        /// </summary>
        /// <param name="rdd">RDD containing array of object. The array acts as a row and items within the array act as columns which the schema is specified in <paramref name="schema"/>. </param>
        /// <param name="schema">The schema of DataFrame.</param>
        /// <returns></returns>
        public DataFrame CreateDataFrame(RDD <object[]> rdd, StructType schema)
        {
            // Note: This is for pickling RDD, convert to RDD<byte[]> which happens in CSharpWorker.
            // The below sqlContextProxy.CreateDataFrame() will call byteArrayRDDToAnyArrayRDD() of SQLUtils.scala which only accept RDD of type RDD[Array[Byte]].
            // In byteArrayRDDToAnyArrayRDD() of SQLUtils.scala, the SerDeUtil.pythonToJava() will be called which is a mapPartitions inside.
            // It will be executed until the CSharpWorker finishes Pickling to RDD[Array[Byte]].
            var rddRow = rdd.MapPartitions(r => r.Select(rr => rr));

            rddRow.serializedMode = SerializedMode.Row;

            return(new DataFrame(sparkSessionProxy.CreateDataFrame(rddRow.RddProxy, schema.StructTypeProxy), SparkContext));
        }