Databricks Features
Reports (5)
Reports Interface
Reports interface for standard and self-service reports is intuitive and easy to use.
Steps to Answer
Requires a minimal number of steps/clicks to answer business question.
Graphs and Charts
Offers a variety of attractive graph and chart formats.
Score Cards
Score cards visually track KPI's.
Dashboards
Provides business users an interface to easily design, refine and collaborate on their dashboards
Self Service (6)
Calculated Fields
Using formulas based on existing data elements, users can create and calculate new field values.
Data Column Filtering
Business users have the ability to filter data in a report based on predefined or automodeled parameters.
Data Discovery
Users can drill down and explore data to discover new insights.
Search
Ability to search global data set to find and discover data.
Collaboration / Workflow
Ability for users to share data and reports they have built within the BI tool and outside the tool through other collaboration platforms.
Automodeling
Tool automatically suggests data types, schemas and hierarchies.
Advanced Analytics (3)
Predictive Analytics
Analyze current and historical trends to make predictions about future events.
Data Visualization
Communicate complex information clearly and effectively through advanced graphical techniques.
Big Data Services
Ability to handle large, complex, and/or siloed data sets.
Building Reports (4)
Data Transformation
Converts data formats of source data into the format required for the reporting system without mistakes.
Data Modeling
Ability to (re)structure data in a manner that allows extracting insights fast and accurate.
WYSIWYG Report Design
Provides business users an interface to easily design and refine their dashboards and reports. (What You See Is What You Get)
Integration APIs
Application Programming Interface - Specification for how the application communicates with other software. API's typically enable integration of data, logic, objects, etc with other software applications.
Model Development (5)
Language Support
Supports programming languages such as Java, C, or Python. Supports front-end languages such as HTML, CSS, and JavaScript
Drag and Drop
Offers the ability for developers to drag and drop pieces of code or algorithms when building models
Pre-Built Algorithms
Provides users with pre-built algorithms for simpler model development
Model Training
Supplies large data sets for training individual models
Feature Engineering
Transforms raw data into features that better represent the underlying problem to the predictive models
Machine/Deep Learning Services (6)
Computer Vision
Offers image recognition services
Natural Language Processing
Offers natural language processing services
Natural Language Generation
Offers natural language generation services
Artificial Neural Networks
Offers artificial neural networks for users
Natural Language Understanding
Offers natural language understanding services
Deep Learning
Provides deep learning capabilities
Deployment (15)
Managed Service
Manages the intelligent application for the user, reducing the need of infrastructure
Application
Allows users to insert machine learning into operating applications
Scalability
Provides easily scaled machine learning applications and infrastructure
Language Flexibility
Allows users to input models built in a variety of languages.
Framework Flexibility
Allows users to choose the framework or workbench of their preference.
Versioning
Records versioning as models are iterated upon.
Ease of Deployment
Provides a way to quickly and efficiently deploy machine learning models.
Scalability
Offers a way to scale the use of machine learning models across an enterprise.
On-Premise
Provides On-Premise deployment options.
Cloud
Provides Cloud deployment options (private or public cloud, hybrid cloud).
Language Flexibility
Allows users to input models built in a variety of languages.
Framework Flexibility
Allows users to choose the framework or workbench of their preference.
Versioning
Records versioning as models are iterated upon.
Ease of Deployment
Provides a way to quickly and efficiently deploy machine learning models.
Scalability
Offers a way to scale the use of machine learning models across an enterprise.
Database (3)
Real-Time Data Collection
Collects, stores, and organizes massive, unstructured data in real time
Data Distribution
Facilitates the disseminating of collected big data throughout parallel computing clusters
Data Lake
Creates a repository to collect and store raw data from sensors, devices, machines, files, etc.
Integrations (2)
Hadoop Integration
Aligns processing and distribution workflows on top of Apache Hadoop
Spark Integration
Aligns processing and distribution workflows on top of Apache Hadoop
Platform (3)
Machine Scaling
Facilitates solution to run on and scale to a large number of machines and systems
Data Preparation
Curates collected data for big data analytics solutions to analyze, manipulate, and model
Spark Integration
Aligns processing and distribution workflows on top of Apache Hadoop
Processing (2)
Cloud Processing
Moves big data collection and processing to the cloud
Workload Processing
Processes batch, real-time, and streaming data workloads in singular, multi-tenant, or cloud systems
Data Transformation (2)
Real-Time Analytics
Facilitates analysis of high-volume, real-time data.
Data Querying
Allows user to query data through query languages like SQL.
Connectivity (4)
Hadoop Integration
Aligns processing and distribution workflows on top of Apache Hadoop
Spark Integration
Aligns processing and distribution workflows on top of Apache Spark
Multi-Source Analysis
Integrates data from multiple external databases.
Data Lake
Facilitates the dissemination of collected big data throughout parallel computing clusters.
Operations (8)
Data Visualization
Processes data and represents interpretations in a variety of graphic formats.
Data Workflow
Strings together specific functions and datasets to automate analytics iterations.
Governed Discovery
Isolates certain datasets and facilitates management of data access.
Embedded Analytics
Allows big data tool to run and record data within external applications.
Notebooks
Use notebooks for tasks such as creating dashboards with predefined, scheduled queries and visualizations
Metrics
Control model usage and performance in production
Infrastructure management
Deploy mission-critical ML applications where and when you need them
Collaboration
Easily compare experiments—code, hyperparameters, metrics, predictions, dependencies, system metrics, and more—to understand differences in model performance.
Administration (4)
Data Modelling
Tools to (re)structure data in a manner that allows extracting insights quickly and accurately
Recommendations
Analyzes data to find and recommend the highest value customer segmentations.
Workflow Management
Tools to create and adjust workflows to ensure consistency.
Dashboards and Visualizations
Presents information and analytics in a digestible, intuitive, and visually appealing way.
Compliance (4)
Sensitive Data Compliance
Supports compliance with PII, GDPR, HIPPA, PCI, and other regulatory standards.
Training and Guidelines
Provides guidelines or training related to sensitive data compliance requirements,
Policy Enforcement
Allows administrators to set policies for security and data governance
Compliance Monitoring
Monitors data quality and send alerts based on violations or misuse
Data Quality (3)
Data Preparation
Curates collected data for big data analytics solutions to analyze, manipulate, and model
Data Distribution
Facilitates the disseminating of collected big data throughout parallel computing clusters
Data Unification
Compile data from across all systems so that users can view relevant information easily.
Management (19)
Reporting
View ETL process data via reports and visualizations like charts and graphs.
Auditing
Record ETL historical data for auditing and potential data correction needs.
Cataloging
Records and organizes all machine learning models that have been deployed across the business.
Monitoring
Tracks the performance and accuracy of machine learning models.
Governing
Provisions users based on authorization to both deploy and iterate upon machine learning models.
Model Registry
Allows users to manage model artifacts and tracks which models are deployed in production.
Data dictionary
Stores the database metadata, that is the definitions of data elements, types, relationships etc.
Data Replication
Creates a copy of the database to maintain consistency and integrity.
Query Language
Allows users to create, update and retrieve data in a database.
Data Modeling
Defines the logical design of the data before building the schemas.
Performance Analysis
Monitors and analyzes critical database attributes like query performance, user sessions, dead lock detail, system errors etc and visualize them on a custom dashboard.
Business Glossary
Lets users build a glossary of business terms, vocabulary and definitions across multiple tools.
Data Discovery
Provides a built-in integrated data catalog that allows users to easily locate data across multiple sources.
Data Profililng
Monitors and cleanses data with the help of business rules and analytical algorithms.
Reporting and Visualization
Visualize data flows and lineage that demonstrates compliance with reports and dashboards through a single console.
Data Lineage
Provides an automated data lineage functionality which provides visibility over the entire data movement journey from data origination to destination.
Cataloging
Records and organizes all machine learning models that have been deployed across the business.
Monitoring
Tracks the performance and accuracy of machine learning models.
Governing
Provisions users based on authorization to both deploy and iterate upon machine learning models.
Functionality (5)
Extraction
Extract data from the designated source(s) like relational databases, JSON files, and XML files.
Transformation
Cleanse and re-format extracted data to the needed target format.
Loading
Load reformatted data into target database, data warehouse, or other storage location.
Automation
Arrange ETL processes to occur automatically on needed time schedule (e.g., daily, weekly, monthly).
Scalability
Capable of scaling processing power up or down based on ETL volume.
System (1)
Data Ingestion & Wrangling
Gives user ability to import a variety of data sources for immediate use
Data Preparation (2)
Connectors
Ability to connect the analytics platform with a wide range of connector options for common data sources, including popular enterprise applications.
Data Governance
Connects to enterprise data governance software, or provides integrated data governance features to avoid misuse of data
Data Modeling and Blending (3)
Data Querying
Using formulas based on existing data elements, users can create and calculate new field values
Data Filtering
Business users have the ability to filter data in a report based on predefined or automodeled parameters.
Data Blending
Allows the user to combine data from multiple sources into a functioning dataset.
Data Management (10)
Data Integration
Consolidates, Cleanses and Normalizes data from multiple disparate sources.
Data Compression
Helps save storage capacity and improves query performance.
Data Quality
Eliminates data inconsistency and duplications ensuring data integrity.
Built-In Data Analytics
SQL based analytics functions like Time series, pattern matching, geospatial analytics etc.
In-Database Machine Learning
Provides built in capabilities like machine learning algorithms, data preparation functions, model evaluation and management etc.
Data Lake Analytics
Allows data querying across data formats like parquet, ORC, JSON etc and analyze complex data types on HDFS
Data Integration
Integrates data and data-related technologies into a single environment.
Metadata
Provides metadata management capabilities.
Self-service
Empowers the user via a self-service capability to manage data workflows.
Automated workflows
Completely automates end-to-end data workflows across the data integration lifecycle.
Integration (3)
AI/ ML Integration
Integrates with data science workflows, Machine Learning and artificial intelligence (AI) capabilities.
BI Tool Integration
Integrates with BI Tools to transform data into Actionable Insights.
Data lake Integration
Provides speed in data processing and capturing unstructured, semi-structured and streaming data.
Performance (1)
Scalability
Manages huge volumes of data, upscale or downscale as per demand.
Maintenance (3)
Data Migration
Allows data movement from one database to another.
Backup and Recovery
Provides data backup and recovery functionality to protect and restore a database.
Multi-User Environment
Allows users to access and work on data concurrently, supporting several views of the data.
Security (7)
Data Encryption
Encrypts and transforms data at the database from a readable state into a ciphertext of unreadable characters.
User Access Control
Allows restricted user acess to modify depending on the access level.
Data Governance
Policies, procedures and standards to manage and access data.
Data Security
Restricts data access at a cell level, mask or hide parts of cells, and encrypt data at rest and in motion
Access Control
Authenticates and authorizes individuals to access the data they are allowed to see and use.
Roles Management
Helps identify and manage the roles of owners and stewards of data.
Compliance Management
Helps adhere to data privacy regulations and norms.
Maintainence (2)
Data Quality Management
Defines, validates, and monitors business rules to safeguard master data readiness.
Policy Management
Allows users to create and review data policies to make them consistent across the organization.
Analytics (2)
Analytics capabilities
Provides a high performance, flexibile analytics platform to support data management and embrace data driven decision making.
Dasboard visualizations
Collect and displays metrics across the data integration via a dashboard.
Monitoring and Management (2)
Data Observability
Involved solely in monitoring data pipelines, sending alerts and troubleshooting data.
Testing capabilities
Deploys testing capabilities such as report testing, big data testing, cloud data migration testing, ETL and data warehouse testing.
Cloud Deployment (2)
Hybrid cloud support
Supports analytical platforms and data pipelines across complex hybrid environments.
Cloud migration capabilities
Supports migration of component or pipeline to different cloud environments.
Generative AI (13)
AI Text Generation
Allows users to generate text based on a text prompt.
AI Text Summarization
Condenses long documents or text into a brief summary.
AI Text Generation
Allows users to generate text based on a text prompt.
AI Text Summarization
Condenses long documents or text into a brief summary.
AI Text Generation
Allows users to generate text based on a text prompt.
AI Text Summarization
Condenses long documents or text into a brief summary.
AI Text Generation
Allows users to generate text based on a text prompt.
AI Text Summarization
Condenses long documents or text into a brief summary.
AI Text-to-Image
Provides the ability to generate images from a text prompt.
AI Text Generation
Allows users to generate text based on a text prompt.
AI Text Summarization
Condenses long documents or text into a brief summary.
AI Text Generation
Allows users to generate text based on a text prompt.
AI Text Summarization
Condenses long documents or text into a brief summary.
Scalability and Performance - Generative AI Infrastructure (3)
AI High Availability
Ensures that the service is reliable and available when needed, minimizing downtime and service interruptions.
AI Model Training Scalability
Allows the user to scale the training of models efficiently, making it easier to deal with larger datasets and more complex models.
AI Inference Speed
Provides the user the ability to get quick and low-latency responses during the inference stage, which is critical for real-time applications.
Cost and Efficiency - Generative AI Infrastructure (3)
AI Cost per API Call
Offers the user a transparent pricing model for API calls, enabling better budget planning and cost control.
AI Resource Allocation Flexibility
Provides the user the ability to allocate computational resources based on demand, making it cost-effective.
AI Energy Efficiency
Allows the user to minimize energy usage during both training and inference, which is becoming increasingly important for sustainable operations.
Integration and Extensibility - Generative AI Infrastructure (3)
AI Multi-cloud Support
Offers the user the flexibility to deploy across multiple cloud providers, reducing the risk of vendor lock-in.
AI Data Pipeline Integration
Provides the user the ability to seamlessly connect with various data sources and pipelines, simplifying data ingestion and pre-processing.
AI API Support and Flexibility
Allows the user to easily integrate the generative AI models into existing workflows and systems via APIs.
Security and Compliance - Generative AI Infrastructure (3)
AI GDPR and Regulatory Compliance
Helps the user maintain compliance with GDPR and other data protection regulations, which is crucial for businesses operating globally.
AI Role-based Access Control
Allows the user to set up access controls based on roles within the organization, enhancing security.
AI Data Encryption
Ensures that data is encrypted during transit and at rest, providing an additional layer of security.
Usability and Support - Generative AI Infrastructure (2)
AI Documentation Quality
Provides the user with comprehensive and clear documentation, aiding in quicker adoption and troubleshooting.
AI Community Activity
Allows the user to gauge the level of community support and third-party extensions available, which can be useful for problem-solving and extending functionality.
Customization - AI Agent Builders (3)
Natural Language Configuration
Supports configuration using natural language instructions.
Tone Customization
Allows users to customize the tone of agent.
Security Guardrails
Enables definition of clear security guardrails for agent actions.
Functionality - AI Agent Builders (4)
Omni-channel Support
Provides support across web, mobile, messaging apps, and other channels.
Agent Branding
Allows customization of agent branding, including visual appearance and conversational style.
Proactive Response Capabilities
Equips agents with proactive response capabilities based on predefined triggers.
Seamless Human Escalation
Facilitates seamless escalation to human employees for complex issues.
Data and Analytics - AI Agent Builders (3)
Analytics & Reporting
Provides analytics and reporting on agent performance and interactions.
Contextual Awareness
Offers agents the ability to maintain contextual awareness across interactions.
Data Privacy Compliance
Ensures compliance with data privacy and governance requirements.
Integration - AI Agent Builders (4)
Workflow Automation
Automates workflows and actions based on agent responses.
API Usage
Allows the use of APIs for advanced agent configuration.
Platform Interoperability
Enables interoperability with multiple platforms for unified experiences.
CRM Data Integration
Allows integration with CRM data to ground agent responses in business context.
Agentic AI - Data Governance (6)
Autonomous Task Execution
Capability to perform complex tasks without constant human input
Multi-step Planning
Ability to break down and plan multi-step processes
Cross-system Integration
Works across multiple software systems or databases
Adaptive Learning
Improves performance based on feedback and experience
Natural Language Interaction
Engages in human-like conversation for task delegation
Decision Making
Makes informed choices based on available data and objectives
Agentic AI - DataOps Platforms (5)
Autonomous Task Execution
Capability to perform complex tasks without constant human input
Multi-step Planning
Ability to break down and plan multi-step processes
Cross-system Integration
Works across multiple software systems or databases
Adaptive Learning
Improves performance based on feedback and experience
Decision Making
Makes informed choices based on available data and objectives
Agentic AI - Analytics Platforms (7)
Autonomous Task Execution
Capability to perform complex tasks without constant human input
Multi-step Planning
Ability to break down and plan multi-step processes
Cross-system Integration
Works across multiple software systems or databases
Adaptive Learning
Improves performance based on feedback and experience
Natural Language Interaction
Engages in human-like conversation for task delegation
Proactive Assistance
Anticipates needs and offers suggestions without prompting
Decision Making
Makes informed choices based on available data and objectives
Agentic AI - Data Science and Machine Learning Platforms (7)
Autonomous Task Execution
Capability to perform complex tasks without constant human input
Multi-step Planning
Ability to break down and plan multi-step processes
Cross-system Integration
Works across multiple software systems or databases
Adaptive Learning
Improves performance based on feedback and experience
Natural Language Interaction
Engages in human-like conversation for task delegation
Proactive Assistance
Anticipates needs and offers suggestions without prompting
Decision Making
Makes informed choices based on available data and objectives
Deployment & Integration - Analytics Platforms (4)
No-code Dashboard Builder
Enables non-technical users to build dashboards through intuitive, drag-and-drop interfaces
Report Scheduling and Automation
Enables automated report generation and scheduled delivery to stakeholders
Embedded Analytics and White-labeling
Allows dashboards and analytics to be embedded into external apps with branding flexibility
Data Source Connectivity
Supports integration with major data sources like cloud data warehouses, SQL/NoSQL databases, and SaaS applications
Performance & Scalability - Analytics Platforms (2)
Large data handling and Query Speed
Efficiently processes large datasets with minimal lag and ensures high performance under load
Concurrent User Support
Maintains performance and uptime during high traffic from multiple users or teams
Advanced Analytics & Modeling - Analytics Platforms (3)
Data Modeling and Governance
Supports semantic data layers, role-based access controls, and metadata governance
Notebook and Script Integration
Integrates with Jupyter, Python, or R for custom analytics and modeling
Built-in Predictive and Statistical Models
Provides native tools for statistical analysis, forecasting, and trend prediction
Agentic AI Capabilities - Analytics Platforms (4)
Auto-generated Insights and Narratives
Uses AI to generate textual summaries, key takeaways, and data stories from dashboards
Natural Language Queries
Allows users to query data and build reports using conversational or plain language
Proactive KPI Monitoring and Alerts
Detects and notifies users about KPI anomalies or significant metric changes in real time
AI Agents for Analytical Follow-ups
Recommends next questions, analyses, or exploration paths using autonomous AI agents
Personalized Intelligence - Analytics Platforms (3)
Behavioral Learning for Contextual Query Refinement
Learns from historical user interactions to improve and personalize query results over time
Role-based Insight Personalization
Tailors dashboard views and suggestions based on user roles, access levels, and past behavior
Conversational and Prompt-based Analytics
Supports AI-driven exploration via prompts or multi-turn conversations for iterative querying
Traffic Management & Performance - AI Gateways (3)
Token-Aware Rate Limiting
Enforces strict request and token quotas per user, tenant, or application to prevent system overloads and manage API budget limits.
Semantic Caching
Caches responses for identical or semantically similar prompts to significantly reduce API latency and eliminate redundant token costs.
Multi-Model Routing & Fallbacks
Routes API requests to various LLM providers via a single unified API and automatically fails over to backup models if downtime occurs.
Governance & Observability - AI Gateways (3)
Data Privacy
Intercepts traffic to detect and mask Personally Identifiable Information (PII) or block malicious prompt injections before the request reaches the external model.
Cost Tracking
Provides granular dashboards to monitor token consumption and calculate exact monetary costs broken down by application, team, or specific model.
Centralized API Key Security
Securely vaults and manages external LLM API keys within the gateway so developers do not need to hardcode credentials into their applications.





