C# (CSharp) IDataFrameProxy.RandomSplit Beispiele

Programmiersprache: C# (CSharp)

Klasse / Typ: IDataFrameProxy

Methode / Funktion: RandomSplit

Beispiele auf hotexamples.com: 1

C# (CSharp) IDataFrameProxy.RandomSplit - 1 Beispiele gefunden. Dies sind die am besten bewerteten C# (CSharp) Beispiele für die IDataFrameProxy.RandomSplit, die aus Open Source-Projekten extrahiert wurden. Sie können Beispiele bewerten, um die Qualität der Beispiele zu verbessern.

Häufig verwendete Methoden

Anzeigen Verbergen

Alias(1)

Limit(1)

UnionAll(1)

ToRDD(1)

ToJSON(1)

Subtract(1)

Sort(1)

SelectExpr(1)

Select(1)

Sample(1)

Replace(1)

Repartition(1)

RegisterTempTable(1)

RandomSplit(1)

Persist(1)

Join(1)

Coalesce(1)

JavaToCSharp(1)

Intersect(1)

GroupBy(1)

GetShowString(1)

GetSchema(1)

GetQueryExecution(1)

GetExecutedPlan(1)

GetColumn(1)

Filter(1)

DropNa(1)

DropDuplicates(1)

Drop(1)

Distinct(1)

Count(1)

Unpersist(1)

Beispiel #1

Datei anzeigen

Datei: DataFrame.cs Projekt: CapeTownCoders/SparkCLR

        /// <summary>
        /// Randomly splits this DataFrame with the provided weights.
        /// Reference to https://github.com/apache/spark/blob/branch-1.4/python/pyspark/sql/dataframe.py, randomSplit(self, weights, seed=None)
        /// </summary>
        /// <param name="weights">list of weights with which to split the DataFrame. Weights will be normalized if they don't sum up to 1.0</param>
        /// <param name="seed">The seed for sampling</param>
        /// <returns></returns>
        public IEnumerable <DataFrame> RandomSplit(IEnumerable <double> weights, int?seed = null)
        {
            foreach (var weight in weights)
            {
                if (weight < 0.0)
                {
                    throw new ArgumentException(string.Format("Weights must be positive. Found weight value: {0}", weight));
                }
            }

            if (seed == null)
            {
                seed = new Random().Next();
            }

            return(dataFrameProxy.RandomSplit(weights, seed.Value).Select(dfProxy => new DataFrame(dfProxy, sparkContext)));
        }