Snowflake Features
Model Development (5)
Language Support
Supports programming languages such as Java, C, or Python. Supports front-end languages such as HTML, CSS, and JavaScript
Drag and Drop
Offers the ability for developers to drag and drop pieces of code or algorithms when building models
Pre-Built Algorithms
Provides users with pre-built algorithms for simpler model development
Model Training
Supplies large data sets for training individual models
Feature Engineering
Transforms raw data into features that better represent the underlying problem to the predictive models
Machine/Deep Learning Services (6)
Computer Vision
Offers image recognition services
Natural Language Processing
Offers natural language processing services
Natural Language Generation
Offers natural language generation services
Artificial Neural Networks
Offers artificial neural networks for users
Natural Language Understanding
Offers natural language understanding services
Deep Learning
Provides deep learning capabilities
Deployment (15)
Managed Service
Manages the intelligent application for the user, reducing the need of infrastructure
Application
Allows users to insert machine learning into operating applications
Scalability
Provides easily scaled machine learning applications and infrastructure
Language Flexibility
Allows users to input models built in a variety of languages.
Framework Flexibility
Allows users to choose the framework or workbench of their preference.
Versioning
Records versioning as models are iterated upon.
Ease of Deployment
Provides a way to quickly and efficiently deploy machine learning models.
Scalability
Offers a way to scale the use of machine learning models across an enterprise.
On-Premise
Provides On-Premise deployment options.
Cloud
Provides Cloud deployment options (private or public cloud, hybrid cloud).
Language Flexibility
Allows users to input models built in a variety of languages.
Framework Flexibility
Allows users to choose the framework or workbench of their preference.
Versioning
Records versioning as models are iterated upon.
Ease of Deployment
Provides a way to quickly and efficiently deploy machine learning models.
Scalability
Offers a way to scale the use of machine learning models across an enterprise.
Database (3)
Real-Time Data Collection
Collects, stores, and organizes massive, unstructured data in real time
Data Distribution
Facilitates the disseminating of collected big data throughout parallel computing clusters
Data Lake
Creates a repository to collect and store raw data from sensors, devices, machines, files, etc.
Integrations (2)
Hadoop Integration
Aligns processing and distribution workflows on top of Apache Hadoop
Spark Integration
Aligns processing and distribution workflows on top of Apache Hadoop
Platform (3)
Machine Scaling
Facilitates solution to run on and scale to a large number of machines and systems
Data Preparation
Curates collected data for big data analytics solutions to analyze, manipulate, and model
Spark Integration
Aligns processing and distribution workflows on top of Apache Hadoop
Processing (2)
Cloud Processing
Moves big data collection and processing to the cloud
Workload Processing
Processes batch, real-time, and streaming data workloads in singular, multi-tenant, or cloud systems
Data Transformation (2)
Real-Time Analytics
Facilitates analysis of high-volume, real-time data.
Data Querying
Allows user to query data through query languages like SQL.
Connectivity (4)
Hadoop Integration
Aligns processing and distribution workflows on top of Apache Hadoop
Spark Integration
Aligns processing and distribution workflows on top of Apache Spark
Multi-Source Analysis
Integrates data from multiple external databases.
Data Lake
Facilitates the dissemination of collected big data throughout parallel computing clusters.
Operations (8)
Data Visualization
Processes data and represents interpretations in a variety of graphic formats.
Data Workflow
Strings together specific functions and datasets to automate analytics iterations.
Governed Discovery
Isolates certain datasets and facilitates management of data access.
Embedded Analytics
Allows big data tool to run and record data within external applications.
Notebooks
Use notebooks for tasks such as creating dashboards with predefined, scheduled queries and visualizations
Metrics
Control model usage and performance in production
Infrastructure management
Deploy mission-critical ML applications where and when you need them
Collaboration
Easily compare experiments—code, hyperparameters, metrics, predictions, dependencies, system metrics, and more—to understand differences in model performance.
Administration (4)
Data Modelling
Tools to (re)structure data in a manner that allows extracting insights quickly and accurately
Recommendations
Analyzes data to find and recommend the highest value customer segmentations.
Workflow Management
Tools to create and adjust workflows to ensure consistency.
Dashboards and Visualizations
Presents information and analytics in a digestible, intuitive, and visually appealing way.
Compliance (4)
Sensitive Data Compliance
Supports compliance with PII, GDPR, HIPPA, PCI, and other regulatory standards.
Training and Guidelines
Provides guidelines or training related to sensitive data compliance requirements,
Policy Enforcement
Allows administrators to set policies for security and data governance
Compliance Monitoring
Monitors data quality and send alerts based on violations or misuse
Data Quality (3)
Data Preparation
Curates collected data for big data analytics solutions to analyze, manipulate, and model
Data Distribution
Facilitates the disseminating of collected big data throughout parallel computing clusters
Data Unification
Compile data from across all systems so that users can view relevant information easily.
Management (17)
Cataloging
Records and organizes all machine learning models that have been deployed across the business.
Monitoring
Tracks the performance and accuracy of machine learning models.
Governing
Provisions users based on authorization to both deploy and iterate upon machine learning models.
Model Registry
Allows users to manage model artifacts and tracks which models are deployed in production.
Data dictionary
Stores the database metadata, that is the definitions of data elements, types, relationships etc.
Data Replication
Creates a copy of the database to maintain consistency and integrity.
Query Language
Allows users to create, update and retrieve data in a database.
Data Modeling
Defines the logical design of the data before building the schemas.
Performance Analysis
Monitors and analyzes critical database attributes like query performance, user sessions, dead lock detail, system errors etc and visualize them on a custom dashboard.
Business Glossary
Lets users build a glossary of business terms, vocabulary and definitions across multiple tools.
Data Discovery
Provides a built-in integrated data catalog that allows users to easily locate data across multiple sources.
Data Profililng
Monitors and cleanses data with the help of business rules and analytical algorithms.
Reporting and Visualization
Visualize data flows and lineage that demonstrates compliance with reports and dashboards through a single console.
Data Lineage
Provides an automated data lineage functionality which provides visibility over the entire data movement journey from data origination to destination.
Cataloging
Records and organizes all machine learning models that have been deployed across the business.
Monitoring
Tracks the performance and accuracy of machine learning models.
Governing
Provisions users based on authorization to both deploy and iterate upon machine learning models.
System (1)
Data Ingestion & Wrangling
Gives user ability to import a variety of data sources for immediate use
Data Management (6)
Data Integration
Consolidates, Cleanses and Normalizes data from multiple disparate sources.
Data Compression
Helps save storage capacity and improves query performance.
Data Quality
Eliminates data inconsistency and duplications ensuring data integrity.
Built-In Data Analytics
SQL based analytics functions like Time series, pattern matching, geospatial analytics etc.
In-Database Machine Learning
Provides built in capabilities like machine learning algorithms, data preparation functions, model evaluation and management etc.
Data Lake Analytics
Allows data querying across data formats like parquet, ORC, JSON etc and analyze complex data types on HDFS
Integration (3)
AI/ ML Integration
Integrates with data science workflows, Machine Learning and artificial intelligence (AI) capabilities.
BI Tool Integration
Integrates with BI Tools to transform data into Actionable Insights.
Data lake Integration
Provides speed in data processing and capturing unstructured, semi-structured and streaming data.
Performance (1)
Scalability
Manages huge volumes of data, upscale or downscale as per demand.
Maintenance (3)
Data Migration
Allows data movement from one database to another.
Backup and Recovery
Provides data backup and recovery functionality to protect and restore a database.
Multi-User Environment
Allows users to access and work on data concurrently, supporting several views of the data.
Security (11)
Data Encryption
Encrypts and transforms data at the database from a readable state into a ciphertext of unreadable characters.
User Access Control
Allows restricted user acess to modify depending on the access level.
Data Governance
Policies, procedures and standards to manage and access data.
Data Security
Restricts data access at a cell level, mask or hide parts of cells, and encrypt data at rest and in motion
Role-Based Authorization
Provides predefined system roles, privileges, and user-defined roles to users.
Authentication
Allows integration with external security mechanisms like Kerberos, LDAP authentication etc.
Audit Logs
Provides an audit log to track access and operations performed on databases for regulatory compliance.
Encryption
Provides encryption capability for all the data at rest using encryption keys.
Access Control
Authenticates and authorizes individuals to access the data they are allowed to see and use.
Roles Management
Helps identify and manage the roles of owners and stewards of data.
Compliance Management
Helps adhere to data privacy regulations and norms.
Storage (2)
Data Model
Stores data tables as columns.
Data Types
Supports multiple data types like lists, sets, hashes (similar to map), sorted sets etc.
Availability (3)
Auto Sharding
Implements auto horizontal data partitioning that allows storing data on more than one node to scale out.
Auto Recovery
Restores a database to a correct (consistent) state in the event of a failure.
Data Replication
Copy data across multiple servers through master-slave, peer-to-peer replication architecture etc.
Performance (1)
Integrated Cache
Stores frequently-used data in system memory quickly.
Support (2)
Multi-Model
Provides support to store, index and query data in more than one format.
Operating Systems
Available on multiple operating systems like Linux, Windows, MacOS etc.
Maintainence (2)
Data Quality Management
Defines, validates, and monitors business rules to safeguard master data readiness.
Policy Management
Allows users to create and review data policies to make them consistent across the organization.
Centralized computation (1)
Centralized Computation
Offers a centralize, neutral location for parties to conduct data analysis.
Localized computation (1)
Localized computation
Offers localized computation, where data remains where it resides and is called by API in order to conduct analysis.
Generative AI (9)
AI Text Generation
Allows users to generate text based on a text prompt.
AI Text Summarization
Condenses long documents or text into a brief summary.
AI Text Generation
Allows users to generate text based on a text prompt.
AI Text Summarization
Condenses long documents or text into a brief summary.
AI Text Generation
Allows users to generate text based on a text prompt.
AI Text Summarization
Condenses long documents or text into a brief summary.
AI Text-to-Image
Provides the ability to generate images from a text prompt.
AI Text Generation
Allows users to generate text based on a text prompt.
AI Text Summarization
Condenses long documents or text into a brief summary.
Agentic AI - Data Governance (6)
Autonomous Task Execution
Capability to perform complex tasks without constant human input
Multi-step Planning
Ability to break down and plan multi-step processes
Cross-system Integration
Works across multiple software systems or databases
Adaptive Learning
Improves performance based on feedback and experience
Natural Language Interaction
Engages in human-like conversation for task delegation
Decision Making
Makes informed choices based on available data and objectives
Agentic AI - Data Science and Machine Learning Platforms (7)
Autonomous Task Execution
Capability to perform complex tasks without constant human input
Multi-step Planning
Ability to break down and plan multi-step processes
Cross-system Integration
Works across multiple software systems or databases
Adaptive Learning
Improves performance based on feedback and experience
Natural Language Interaction
Engages in human-like conversation for task delegation
Proactive Assistance
Anticipates needs and offers suggestions without prompting
Decision Making
Makes informed choices based on available data and objectives





