Are there any functions like discretization, normalization and data transformation (categorical to numeric) on Google Cloud Dataprep?
2 Answers
The documentation has a list of normalization techniques among which Trim whispace, Standardize units, etc. From the docs:
Trim whitespace
You can trim out whitespace from an individual column via transform. The TRIM function applied to string values removes the leading and trailing whitespace:
set col: myCol value: TRIM(myCol)
Also there is a list of transform operations classified by category: Initial parsing, Manage columns, etc. For example, in the Aggregate Transform doc:
...
Basic Usage
aggregate value:MAX(totalSales) group: Region
Output: Reshapes the dataset to two columns: The new Region column contains the unique values from the source Region column, and the max_totalSales column contains the maximum value of the values in the original totalSales column for each value in Region.
......
See the the complete list of transforms for more details.
![](../../users/profiles/-1.webp)
- 1
- 1
![](../../users/profiles/2650325.webp)
- 1,945
- 2
- 12
- 20