BDB Release Notes 9.5

Platform

New Features

Data Center

New Data Connector: Connect seamlessly with the new Apache Pinot data connector.
Enhanced support for Datasheet: Datastore and Report creation are possible now directly from the Datasheets.
Persistent Filter Values: Improved user experience by retaining the applied filters for quicker access to Data Sets.

Admin

Migration Restrictions: Prevents migrating Dashboards, API Services, and Pipelines within the same server and space for enhanced control.
Resource Observability: Introduced Resource Observability for the platform, providing enhanced visibility into resource usage and performance.

Entitlement Implementation: Access and manage control for shared items across the Data Center.

Enhancements

Data Center:

Datasheet List Page Updates:
1. Added a “Refresh Data” icon for Sandbox-based Datasheets.
2. Added an Information icon to get additional details about each datasheet.

Enhanced Execution Settings: Configure multiple node pools for optimized performance.
File Scanning: Implemented file scanning before uploading a file for improved platform security.
Data Store Scheduler Refresh: The default refresh interval for the Data Store scheduler is now set to weekly, with the refresh day aligned to the original creation day of the Data Store.
Notification Service Update: Enhanced the notification service for improved reliability and user experience.
Strengthened security measures to protect user data and privacy, addressing vulnerabilities found in a recent security audit.

Data Catalog Search

New Features

Columns Tab: The users get notified if a column contains personal information.
Delete Catalog Option: Provides a Delete icon for admin users to remove catalog entries.

Enhancements

Optimized Service Response: Enhanced response time for faster and smoother access to the Data Catalog, providing a more efficient experience for users.
Python Library Updates: Updated to the latest versions for improved stability, performance, and compatibility across the Platform.
History Tab: Displays last updated details, including recent activations and deactivations of pipeline and jobs.

Data Preparation

New Features

Introduced a new menu bar featuring transformation icons for faster access and smoother navigation, enhancing workflow efficiency.
Copy and Paste Transforms: Users can replicate transforms and apply them to another Data Preparation for the same data source.

Report

New Features

Report Creation: Reports can now be created based on data stores from the Apache Pinot data connector.
Chart Annotation: Plot annotation lines on the Line and Mixed charts to emphasize key events, enriching data storytelling and providing greater visual clarity.
Threshold Lines: Threshold lines have been introduced on the Line, Mixed, and Bar charts aiding in more precise data analysis.
New Charts:

Benchmark Analysis: Added a powerful Benchmark Analysis chart for in-depth performance evaluation and comparison.
Candlestick: Introduced the Candlestick chart, ideal for detailed financial data analysis and trend insights.

Enhancements

Optimized Report charts for accurate display in mobile browsers.
Word Cloud Chart Enhancement: Introduced support for displaying individual words or full sentences in the Word Cloud chart, with a new dataset type selection option for flexible visualization.

Designer

Enhancements

Introducing Annotations: Plot annotation lines on the Line, Mixed, and Timeseries charts to mark specific events, providing clearer insights and contextual analysis.
Threshold Lines & Fill Color: Ensure better data visibility to interpret and analyze data trends with the help of threshold lines and fill color functionality in the Bar, Line, Mixed, and Timeseries charts.
Benchmark Analysis Component: Combine Boxplot and Data Grid in one powerful visualization for a better understanding of the data benchmarks.
Slider for Bar Chart: Effortlessly manage large datasets by enabling the slider feature.
Entitlement Implementation: Introduced access control for shared dashboards and workspaces.

Enhancements

Improved Date Picker: Customizable fonts, colors, and adjustable year and month ranges for a seamless user experience.
Data Export with Formatters: Enhanced data export functionality to include formatters for Excel and CSV, ensuring consistent and structured data presentation.
Export to CSV: Introduced a new method for exporting chart data to CSV through a plugin script.
Sorting Enhancement for Data Grid: Supports sorting the date columns with the following types of date-time formats:

YYYY-MM-DD HH:MM:SS
MM/DD/YYYY HH:MM:SS
DD-MM-YYYY HH:MM:SS
ISO 8601 format (e.g., 2023-08-23T14:30:00Z)
Time formats such as 12-hour and 24-hour clocks (HH:MM)

Please Note:

The previous method used to export chart data to CSV has been deprecated as of this release.
Annotation mapping is currently supported for CSV and Excel data.

Data Science Lab

New Features

Consolidated Interface: The Export and Register options are now combined into a single interface for streamlined access.
Adjustable Repository Pane: Users can now adjust the width of the repository pane for better project organization.
Registered Model and API Details: Added “Created by” and “Parent Projects” details for each listed model or API.
Project-Level Node Pool Management: Added support for configuring multiple node pools at the project level, enabling more efficient resource allocation and scalability.
Feature Store Integration: Integrated the feature store as a data source for AutoML experiments.
Notebook Markdown Control: Introduced expand and collapse functionality for Notebook markdown cells.
Sandbox File Support for Multiple Excel Sheets: Upload Sandbox files with multiple Excel sheets in Notebooks, improving functionality across Data Preparation, Data Profile, Preview, and AutoML experiments, and streamlining data processing.
Entitlement Implementation: Access control for shared items across the Data Science Lab for Projects, Notebooks, and Models.
Library Status Tracking: Get enhanced visibility over operations by tracking library status updates for each installed library for a DSL Project.

Enhancement

Feature Store Enhancements: Added Data Preparation and scheduling capabilities to the Feature Store, improving data management and workflow efficiency.
Auto-Save Default: Auto-save is now enabled by default in notebooks.
Modification on Model as API: Registered Model APIs can predict the outputs via job.

Data Pipeline

New Features

New Pipeline Components:
1. PySpark GCS Reader and Writer Components: Enables reading from and writing to Google Cloud Storage using PySpark.
2. BigQuery Reader Compontnt: Facilates reading data from BigQuery.
3. PySpark Script Component: Allows execution of custom PySpark scripts.
Action-based Selection Buttons: Added selection buttons for filtering through pipeline and job lists based on actions and recent run status for enhancing navigation and usability of Data Pipelines and Jobs.
Recent Run: Displays job status for the top 5 recent job runs on the List Jobs page.
Filter Menu in System Pod Status Logs: Added a filter menu for better log management.
Side Menu Bar: Introduced a Side Menu Bar on various pages, including Pipeline Editor, monitor, and testing for improved navigation.
Expandable Graph for Pipeline Monitoring:Introduced an expandable graph to view detailed performance metrics and trends for enhanced monitoring and analysis.
Unique Component and Task Names: Ensure distinct names for components and tasks (when dragging).
Pin/Unpin Functionality for Pipelines and Jobs: Users can now pin or unpin pipelines and jobs for easier access and organization.
Recently Visited Pipelines and Jobs: The homepage displays the 10 most recently visited pipelines and jobs providing quick access to your work.
Spark Upgrade: Upgraded Spark version to 3.5.1 for improved performance.
Java Upgrade: Upgraded Java to 17 for enhanced functionality and security.
Clear Job Logs: The Clear option helps the users clear job logs from the Sandbox location.
A confirmation pop-up window is added for unsaved or newly configured components/ tasks to prevent accidental loss of changes.

Enhancements

Kafka Event Preview:
1. Improved the preview list for pipeline Kafka events to provide more detailed and actionable information.
2. Kafka Preview records can be expanded from 10 to 100 records with filters for the latest, beginning, and customizable timestamp values.
System Pod Logs: All system pod logs are available on the System Component Status page for comprehensive monitoring and diagnostics.
The History Panel now displays details on the creation, update, last activation, and last deactivation of pipelines and jobs.
The Component Version Button remains disabled when no new version is available for the pipeline components.

Product Release Notes

Platform

Data Catalog Search

Data Preparation

Report

Designer

Data Science Lab

Data Pipeline

Connect with BDB Expert