Round decimal numbers in one or several columns using round, floor, or ceil.
Sample rows from a DataFrame or Arrow Table using various sampling methods.
Sort rows in a DataFrame or Arrow Table by one or more columns.
Split a textual column into several columns on each occurrence of the delimiter. Output columns are numbered with a prefix.
Combine rows of two or more datasets into a single output dataset using union, intersection, or custom schema methods.
Filter a dataset to get the top and/or bottom N rows based on specified column(s) sorting.
Transpose multiple rows into columns (widen dataset) using an Index, Label, and Value column.
Transform a column's values into binary columns (one-hot encoding/dummification).
Perform window operations on a dataset with configurable frames and ordering.
Info
We use our own cookies as well as third-party cookies on our websites to enhance your experience, analyze our traffic, and for security and marketing.