Creates upper and/or lower bounds in numerical columns by clipping or clearing outliers.
Group data by specified columns and apply aggregation functions (distinct, max, min, avg, median, sum, std, count, first, last, concat).
Impute missing values in numerical columns with mean, median, or mode.
Enrich one dataset with columns from another using various join types (inner, left, right, full outer, cross).
Keep only specified columns in a DataFrame or Arrow Table.
Move one or more columns to a specific position in a DataFrame or Arrow Table.
Transform boolean values to their negation (TRUE to FALSE, FALSE to TRUE).
Combine every pair of numerical columns with standard + - × ÷ operations.
Reshape a dataset from long to wide format by pivoting a column and aggregating values.
Read data from files supporting multiple standard formats (parquet, json, csv, excel).
Remove rows where specified cells are empty/null.
Rename columns in a DataFrame or Arrow Table.
Info
We use our own cookies as well as third-party cookies on our websites to enhance your experience, analyze our traffic, and for security and marketing.