Data Preparation
New Features
- New Transforms: Added the following transformations to the Data Preparation module.
- ML Transforms
- Convert value to column (One Hot encoding)
- Discretize values
- Expanding window transform
- Feature agglomeration
- encoding
- Label encoding
- Lag Transformation
- Leave one out encoding
- Principal component analysis
- Rolling data
- Singular Value decomposition
- Target-based quantile encoding
- Target encoding
- Weight of evidence encoding
- Functions Transforms
- Formula Based Transformation
-
Message Snackbars: Provided close option in the error/success message snack bars for users to close them.
-
Data Set Level Suggestions: Implemented data set level suggestions by default while opening the Data Preparation page and highlighting the selected column.
-
Auto Preparation: The Auto-Prep button gets disabled after it is applied one time on the selected dataset.
-
Data Profile: The consolidated Data Profile tab is provided displaying Profile, Chart, Suggestion, and Pattern.
-
Data Preparation Naming: Users need to give a name for Data Preparation to save performed actions on the selected dataset; auto-generated names are assigned if no name is provided.
-
Profiling Chart: Integrated the profiling chart into the menu bar.
-
Column Menu: The column menu appears as a drop-down icon instead of a three-bar menu for columns.
-
Angular Version Update: Upgraded Angular version from 7 to 14.
-
Transform Drawer UI: Implemented drawer UI for transforms.
-
Settings: Introduced the Settings icon, opening as a drawer containing SKIP & Total Rows options.
-
The Settings icon will be disabled for a new Data Preparation based on a Data Set.
-
Confirmation Drawer UI: A confirmation drawer is provided under the Steps tab for all the transforms that support the configuration dialog box
-
Column Rename Option: Users can now rename columns by clicking on the drop-down menu icon and get the Rename Column transform in the context menu.
-
Column Profiling: Improved efficiency for large-scale column changes, reducing time consumption.
-
Existing Transforms: Modified the Create new Columns checkbox to the New Column Name field for new transforms.
-
Total Rows: Updated the displayed rows of data on a page to accommodate up to 200 rows per page. At once, up to 5 paginations can show up to 1000 rows.
-
Source and Sample Size information: Included source and sample size information in the panel below.
-
Save Icon: introduced a Save icon for saving data preparation. The applied Data Preparation steps will get auto-saved each time.
-
Close Icon: Replaced the Back icon with the Close icon in the Data Preparation workspace to close a Data Preparation workspace and navigate to the Data Preparation list.
-
Data Set Info Icon: A newly created Data Preparation based on a Data Set will contain an Information icon in the enabled status. Once the Preparation is saved the Information icon will be removed from the Data Preparation menu bar.
Enhancements
- UI updates for Suggestions:
- Proper placing of the Suggestions section with the name of the selected column.
- The Apply button gets enabled only after the selection of one suggestion.
- Implemented highlighting of suggestions for the selected column.
Please Note: The column-level suggestions are displayed based on the selected column from the dataset. The Suggestions section no longer supports the generic suggestions encompassing all columns in the dataset.