Simplifies a dataset by retaining only specified columns, removing all others. Creates a focused view with only the fields needed for downstream analysis.
When to Use
Reduce dataset complexity for better performance
Remove unnecessary or sensitive columns
Prepare data for specific analysis needs
Create lightweight extracts for reporting
Focus on key variables for visualization
Simplify datasets before exporting or sharing
How It Works
Takes a list of columns to retain
Filters the dataset to include only the specified columns
Updates the PDV (Physical Data View) metadata to reflect only kept columns
Preserves the row count but reduces column count
Tracks metadata about the column reduction process
Parameters
Required
columns - Specifies which columns to keep, using either: