| cume_dist {SparkR} | R Documentation |
Window function: returns the cumulative distribution of values within a window partition, i.e. the fraction of rows that are below the current row.
## S4 method for signature 'missing' cume_dist() cume_dist(x = "missing")
x |
empty. Should be used with no argument. |
N = total number of rows in the partition cume_dist(x) = number of values before (and including) x / N
This is equivalent to the CUME_DIST function in SQL.
cume_dist since 1.6.0
Other window_funcs: dense_rank,
dense_rank,
dense_rank,missing-method;
lag, lag,
lag,characterOrColumn-method;
lead, lead,
lead,characterOrColumn,numeric-method;
ntile, ntile,
ntile,numeric-method;
percent_rank, percent_rank,
percent_rank,missing-method;
rank, rank,
rank, rank,ANY-method,
rank,missing-method;
row_number, row_number,
row_number,missing-method
## Not run:
df <- createDataFrame(mtcars)
ws <- orderBy(windowPartitionBy("am"), "hp")
out <- select(df, over(cume_dist(), ws), df$hp, df$am)
## End(Not run)