# DagsHub Reviews
**Vendor:** DagsHub  
**Category:** [Data Science and Machine Learning Platforms](https://www.g2.com/categories/data-science-and-machine-learning-platforms)  
**Average Rating:** 4.8/5.0  
**Total Reviews:** 14
## About DagsHub
DagsHub is a platform that allows you to easily create high-quality datasets for better model performance A single AI platform to curate vision, audio, and document data - automate labeling workflows, and evaluate models. Enterprises with sensitive data, can run on their own infrastructure on-prem and get a full AI platform. Data curation - create the very best datasets. Data annotation - annotate your vision, audio, and document data. Auto labeling - automate your annotation flow with pre-built templates and active learning. Data versioning - version your datasets for reproducibility. Experiment tracking - track your experiment progress, understand trends, and compare results. Model registry - manage your models and deployments in one place. The top data scientists build AI with DagsHub including teams at: Google, Harvard Medicine, Beewise, Macso, and Mana.bio



## DagsHub Pros & Cons
**What users like:**

- Users value the **seamless data management** DagsHub provides, enhancing collaboration and efficiency in model training and experiments. (12 reviews)
- Users value the **seamless integration and versioning of data, experiments, and models** that enhances collaboration and productivity. (12 reviews)
- Users appreciate the **seamless collaboration** DagsHub enables, enhancing productivity and making data management straightforward and reproducible. (11 reviews)
- Users appreciate DagsHub&#39;s **seamless integration** of data, experiments, and models, enhancing collaboration and project management efficiency. (10 reviews)
- Users love the **integrated platform** of DagsHub, which enhances productivity and collaboration while managing complex data pipelines. (10 reviews)
- Integrations (10 reviews)
- Workflow Efficiency (10 reviews)
- Team Collaboration (9 reviews)
- Model Variety (8 reviews)
- Tools Efficiency (8 reviews)

**What users dislike:**

- Users express concern over the **limited functionality** in DagsHub, hindering their ability to extend project capabilities. (2 reviews)
- Users experience **frequent error handling issues** when pushing files to DagsHub, affecting project loading and functionality. (1 reviews)
- Users find DagsHub to be **expensive** , especially noting the limitations of the free plan and access hurdles for academia. (1 reviews)
- Users are frustrated by the **limited customization** options on DagsHub, particularly with the restrictive free plan structure. (1 reviews)
- Users find the **strict limitations of the free plan** restrict their collaboration options significantly. (1 reviews)
- Limited Options (1 reviews)
- Limited Tools (1 reviews)
- Missing Features (1 reviews)
- Poor UI Design (1 reviews)
- Slow Loading (1 reviews)

## DagsHub Reviews
  ### 1. Simplifies LLM Dataset Versioning and Experiment Tracking

**Rating:** 5.0/5.0 stars

**Reviewed by:** Gourav B. | Senior Data Scientist, Enterprise (> 1000 emp.)

**Reviewed Date:** May 08, 2025

**What do you like best about DagsHub?**

DagsHub makes it easy to manage the complex data pipelines required for training and fine-tuning large language models. We use it to version instruction datasets, evaluate prompt variations, and track model performance over time. Having a single source of truth for data, experiments, and models has been a game-changer for reproducibility. The Git-like workflow for data is intuitive, and it integrates smoothly with our existing tools.

**What do you dislike about DagsHub?**

Nothing major. It's working well for our needs. Some visualizations around dataset evolution could make debugging faster, but overall it’s a solid product.

**What problems is DagsHub solving and how is that benefiting you?**

DagsHub is solving the critical problem of managing the complexity of machine learning workflows—especially around data versioning, experiment tracking, and collaboration. For our work with large language models, it simplifies the process of organizing and maintaining instruction datasets, evaluating prompt variations, and tracking model performance across time. By acting as a single source of truth for data, experiments, and models, DagsHub significantly boosts reproducibility and transparency. The Git-like interface for handling data is intuitive and integrates seamlessly with our existing toolchain, which accelerates development and reduces friction across the ML lifecycle. Overall, it's been a foundational tool for scaling our ML operations efficiently.

  ### 2. Reliable Infrastructure for LLM Data and Model Iteration

**Rating:** 5.0/5.0 stars

**Reviewed by:** Ignacio P. | Senior Data Scientist, Mid-Market (51-1000 emp.)

**Reviewed Date:** April 26, 2025

**What do you like best about DagsHub?**

DagsHub lets us keep our LLM training data, experiments, and models tightly connected. We version everything—from raw datasets to tokenizer outputs and model checkpoints. This setup makes it simple to track which data was used, how it was processed, and which experiments led to which results. It’s especially helpful when testing prompt tuning or comparing different model variants. Everything stays reproducible and easy to collaborate on across teams.

**What do you dislike about DagsHub?**

No major issues so far. The platform handles version control and experiment linkage really well. A bit more UI customization would be great, especially for larger projects.

**What problems is DagsHub solving and how is that benefiting you?**

Thanks to DagsHub, we’re spending more time building models and less time wrestling with setup and environment issues. Having code, data, and experiments versioned and tracked together has been a game-changer for traceability and collaboration across teams. What used to be a chaotic mix of tools is now a much more organized and reproducible workflow. It’s streamlined the way we manage ML projects from start to finish.

  ### 3. One-Stop Shop Platform for LLM Data and Experiment Tracking

**Rating:** 4.5/5.0 stars

**Reviewed by:** Verified User in Information Technology and Services | Mid-Market (51-1000 emp.)

**Reviewed Date:** April 28, 2025

**What do you like best about DagsHub?**

DagsHub simplifies versioning large text datasets, tracking fine-tuning experiments, and managing model checkpoints—all within a single platform. In any LLM workflow, connecting a specific dataset version to a model run is essential to ensure reproducibility. DagsHub also makes it easier to manage evaluation metrics across various prompt tuning and fine-tuning experiments, all while maintaining clear data lineage.

**What do you dislike about DagsHub?**

Nothing major — it works very well for my personal LLM projects. While having even more built-in support for prompt datasets would be a nice addition, I find the platform to be already great for full dataset and model versioning.

**What problems is DagsHub solving and how is that benefiting you?**

I have used DagsHub for a few projects, ranging from traditional ML problems like classification to projects involving LLMs. The biggest advantage of the product for me is how easy it is to set up and have almost all experiment tracking, data/model versioning, and lineage available out of the box. This has really simplified my workflows, and I didn’t have to spend much time setting everything up myself.

  ### 4. All the MLOps tools at your fingertips

**Rating:** 5.0/5.0 stars

**Reviewed by:** Pavlo F. | Machine Learning Engineer, Information Technology and Services, Small-Business (50 or fewer emp.)

**Reviewed Date:** January 17, 2025

**What do you like best about DagsHub?**

DagsHub is a best friend of Data Scientists and Machine Learning Engineers since it provides not only a version control repository for the code but also for the data artifacts, such as datasets and models. MLOps tools like DVC and MLflow are available for every repository and hosted on DagsHub out of the box so it's extremely easy to start using them right away! This is such a big advantage because, for example, MLflow tracks machine learning models locally by default so you need to set up an MLflow server when working in a team which isn't obvious and DagsHub is real time saver here. As a cherry on top of the cake, DagsHub offers many GBs of free storage for your data artifacts and you will definitely appreciate it if you want to try it out for your project. Overall, DagsHub is an amazing MLOps platform with many more stuff that will make your life so much easier, such as annotation tools, GitHub integration, Jupyter notebook diffs, etc. The DagsHub documentaion is just great but if you need extra help, the DagsHub team is super responsive on their Discord channel. Feel free to check out my DagsHub project where I describe in detail how I used its features for my model cloud deployment pipeline https://dagshub.com/PavloFesenko/gif_analyzer

**What do you dislike about DagsHub?**

No complaints, I only wish that I have discovered DagsHub earlier. 😄

**What problems is DagsHub solving and how is that benefiting you?**

DagsHub helped me to automate my model deployment pipeline thanks to hosting MLflow and providing GitHub integrations. This allows seamless update of the cloud production systems when a new machine learning model is created by data scientists so that manual actions aren't required anymore. Moreover, all these changes are tracked in MLflow so it's transparent for everyone, and rollback to previous versions can be done automatically if needed. For more details, please check my DagsHub project https://dagshub.com/PavloFesenko/gif_analyzer

  ### 5. Streamlines Data Workflows with Reproducible Experiment Tracking

**Rating:** 4.5/5.0 stars

**Reviewed by:** Manuel M. | Community Organizer, Mid-Market (51-1000 emp.)

**Reviewed Date:** April 21, 2025

**What do you like best about DagsHub?**

DagsHub helps us organize unstructured data—images, text, and more—into well-managed datasets. What stands out is how tightly it connects our data with experiment runs and trained model versions. This makes it easy to reproduce results, compare model performance, and trace issues back to specific data changes. It’s a solid platform for teams that care about reproducibility, collaboration, and data quality in ML projects.

**What do you dislike about DagsHub?**

Nothing major to note—overall, the experience has been smooth. Minor UI improvements could make large-scale project navigation even better.

**What problems is DagsHub solving and how is that benefiting you?**

DagsHub brings order to the chaos of ML projects. Instead of juggling tools for code, data, and experiments, everything is versioned and tracked in one place. It’s made our workflows more reproducible and collaborative, especially when working across teams. We’ve cut down on setup time, improved traceability, and now spend more time actually building models — not debugging environments.

  ### 6. Best Platform for Managing LLM Training Data and Experiments

**Rating:** 4.5/5.0 stars

**Reviewed by:** Nilesh B. | Founder , Small-Business (50 or fewer emp.)

**Reviewed Date:** May 08, 2025

**What do you like best about DagsHub?**

DagsHub is ideal for managing large language model (LLM) training data. We use it to version curated text corpora, track data cleaning steps, and run experiments on fine-tuned models—all with full reproducibility. The ability to connect datasets directly to experiment runs and model outputs helps us stay organized and iterate quickly. It’s also helpful for collaborating across teams working on data prep, prompt engineering, and evaluation.

**What do you dislike about DagsHub?**

No major issues. The platform handles LLM workflows well. A few more features around prompt versioning would be useful, but what’s there is already saving us a ton of time.

**What problems is DagsHub solving and how is that benefiting you?**

No problem as such.

  ### 7. Its my go to platform for experiments, data, and AI models.

**Rating:** 5.0/5.0 stars

**Reviewed by:** Ori C. | GenAI Freelancer &amp; Consultant , Small-Business (50 or fewer emp.)

**Reviewed Date:** March 03, 2025

**What do you like best about DagsHub?**

DagsHub allows me to easily manage unstructured data in the context of complex models and versioning. It allows me to save a fixed state of code and data for every experiment and saves me time dealing with various tools. I have used it in several projects so far, it makes project management a lot easier. saves me time with tool integration because the internal offering is easy to implement.

**What do you dislike about DagsHub?**

No major complaints, the platform has been reliable and intuitive so far.

**What problems is DagsHub solving and how is that benefiting you?**

Classification, detection, basically anything that lives in the unsupervised or supervised reals.

  ### 8. DagsHub - comprehensive platform for data versioning and management, and for experiment tracking

**Rating:** 5.0/5.0 stars

**Reviewed by:** Jakub N. | assistant professor, Higher Education, Mid-Market (51-1000 emp.)

**Reviewed Date:** January 13, 2025

**What do you like best about DagsHub?**

DagsHub provides seamless integration with the data version control tool of my choice, namely DVC; it can be easily used as remote repository for storing large data files, and for storing directories with large amount of files. I also like its integration with Git repository hosting sites, not only GitHub, but also other such services, like GitLab or Bitbucket.

DagsHub repository makes it possible to browse and analyze data files, regardless of whether they are versioned using Git, or using DVC. The visualization of data processing pipeline includes both stages, and outputs / data dependencies.

I have only lightly tried the experiment tracking part of DagsHub, but I like what I have seen so far. DagsHub includes support for both DVC experiments (`dvc exp`) and MLflow experiments tracking.

I have yet to try the data streaming support, or mounting DagsHub storage as S3 filesystem - but it looks like a neat feature.

**What do you dislike about DagsHub?**

I haven't notice any major issues so far. The platform is robust, and caters well to our data tracking needs.

I don't like the very strict limitation of the free plan (maximum of 2 people in a team), but I can understand it. DagsHub does offer full version for academia, but it is at request, and it is not automated (using for example using Shibboleth login, like GitLab does it).

**What problems is DagsHub solving and how is that benefiting you?**

For us, DagsHub solves a few problems. First is helping to avoiding unnecessary re-computations; thanks to DagsHub working as a data version remote, one person can make use of the results computed by another person, even if they use different machines. Second, it allows to examine data via a web browser interface, without the need to access the machine the data was computed on. Third, it helps with dissemination of data together with code (and with experiments and their metrics and associated plots).

  ### 9. End-to-end control of ML data and experiments

**Rating:** 4.5/5.0 stars

**Reviewed by:** Verified User in Computer Software | Small-Business (50 or fewer emp.)

**Reviewed Date:** May 04, 2025

**What do you like best about DagsHub?**

DagsHub gives me full control over my machine learning data and experiments. I can version raw datasets, preprocess them, track experiments, and manage model outputs, all in one place. This tight integration means less time syncing tools and more time improving my models. It’s especially useful for unstructured data like images and documents, where traceability and collaboration are hard to maintain without the right setup.

**What do you dislike about DagsHub?**

Nothing critical so far. The platform handles most of my workflows out of the box. Would be great to see more integrations with external training environments, but the current ones cover most use cases.

**What problems is DagsHub solving and how is that benefiting you?**

It takes care of image data versioning and preprocessing, and experiment tracking.

  ### 10. End-to-End Data and Experiment Management for ML Teams

**Rating:** 4.5/5.0 stars

**Reviewed by:** Liron A. | Freelancer, Small-Business (50 or fewer emp.)

**Reviewed Date:** March 20, 2025

**What do you like best about DagsHub?**

DagsHub provides an out of the box platform to manage data, code and experiments in one place

**What do you dislike about DagsHub?**

Didn't find an easy option to extend the abilities, for example, inheriting part of a model from a different project.

**What problems is DagsHub solving and how is that benefiting you?**

Managing different aspect of the project in one place, so I recreate what works and investigate why

  ### 11. Reliable Data Management and Experiment Tracking for ML Workflows

**Rating:** 4.5/5.0 stars

**Reviewed by:** Michael L. | CTO, Small-Business (50 or fewer emp.)

**Reviewed Date:** March 17, 2025

**What do you like best about DagsHub?**

DagsHub makes it simple to organize and version unstructured data while seamlessly connecting it to experiments and model training. The built-in experiment tracking ensures every model run is reproducible, making it easy to compare results and iterate efficiently. Having data, experiments, and models integrated into a single platform removes the hassle of juggling multiple tools, improving both productivity and collaboration.

**What do you dislike about DagsHub?**

No significant drawbacks so far—the platform has been smooth and effective for our ML workflows.

**What problems is DagsHub solving and how is that benefiting you?**

Version management of datasets, models, and experiments - all in one place.

  ### 12. Ideal for multimodal workflows

**Rating:** 4.5/5.0 stars

**Reviewed by:** Abid Ali  A. | Data Scientist, Mid-Market (51-1000 emp.)

**Reviewed Date:** January 13, 2025

**What do you like best about DagsHub?**

DagsHub is super helpful for handling multimodal data like vision, audio, and text. It makes cleaning and organizing unstructured data really easy. The built-in experiment tracking and model management tools help us stay on top of everything. The best part? It’s simple enough for anyone on the team to use.

**What do you dislike about DagsHub?**

Honestly, nothing so far—it does exactly what we need.

**What problems is DagsHub solving and how is that benefiting you?**

It provides all the nessary machine learning tools at one place.

  ### 13. I love DagsHub...

**Rating:** 5.0/5.0 stars

**Reviewed by:** Ramaiah C. | Sr. AI/ML Consultant, Small-Business (50 or fewer emp.)

**Reviewed Date:** January 31, 2025

**What do you like best about DagsHub?**

DagsHub simplifies working with multimodal data by streamlining data transformation, experiment tracking, and model management. Its automation tools enhance labeling efficiency, accelerating workflows. With an intuitive interface, it ensures seamless collaboration across teams.

**What do you dislike about DagsHub?**

I haven’t encountered any problems, it’s been a smooth and enjoyable experience.

**What problems is DagsHub solving and how is that benefiting you?**

DagsHub simplifies working with multimodal data by streamlining data transformation, experiment tracking, and model management.

  ### 14. DAGsHub is a GitHub supplement for Data Scientists and ML Engineers.

**Rating:** 5.0/5.0 stars

**Reviewed by:** mounika s. | Data Annotator, Mid-Market (51-1000 emp.)

**Reviewed Date:** November 14, 2023

**What do you like best about DagsHub?**

GAGsHub is where people build data science projects Cover the entire machine learning life cycle, no no DevOps required.We can track experiments.We can label the data and visualize, compare, andnshare our results.With a community of thousands of machine learning professionals,DAGsHub helps large international teams and individuals build projects that advancce audio.We can communicate effectively by having interactive discussions on any experiment or filentake notes on the best model architecture or review a ream members's contribution build a knowledge base for your future self and your team.Close the loop from data to production, faster than ever that's the magic of DAGsHub and start building now.

**What do you dislike about DagsHub?**

Users might get an error when trying to push files to DAGsHub while pulling files might work.When trying to load a Label Studio project from DAGsHub Annotations. it fails with the Runtime error.

**What problems is DagsHub solving and how is that benefiting you?**

DAGshub makes the maling better besiness decisions and measuring performance and can develop better products and easily we can incerease efficency mitigating risk and fraud and can incerease customer experinces.



- [View DagsHub pricing details and edition comparison](https://www.g2.com/products/dagshub/reviews?section=pricing&secure%5Bexpires_at%5D=2026-05-14+05%3A06%3A45+-0500&secure%5Bsession_id%5D=f621754d-a67b-425d-88a3-f6a4ab5124e1&secure%5Btoken%5D=2b9d2428281cf4eef9e6535b5b433a2a6aaacff75ff3975538ca9524fd81fb98&format=llm_user)

## DagsHub Features
**Deployment**
- Language Flexibility
- Framework Flexibility
- Versioning
- Ease of Deployment
- Scalability

**System**
- Data Ingestion & Wrangling

**Quality**
- Labeler Quality
- Task Quality
- Data Quality
- Human-in-the-Loop

**Deployment**
- Language Flexibility
- Framework Flexibility
- Versioning
- Ease of Deployment
- Scalability

**Recognition Type**
- Emotion Detection
- Object Detection
- Text Detection
- Motion Analysis
- Scene Reconstruction
- Logo Detection
- Explicit Content Detection
- Video Detection

**Model Training & Optimization - Active Learning Tools**
- Model Training Efficiency
- Automated Model Retraining
- Active Learning Process Implementation
- Iterative Training Loop Creation
- Edge Case Discovery

**Integration - Machine Learning**
- Integration

**Model Development**
- Language Support
- Drag and Drop
- Pre-Built Algorithms
- Model Training

**Management**
- Cataloging
- Monitoring
- Governing
- Model Registry

**Model Development**
- Feature Engineering

**Automation**
- Machine Learning Pre-Labeling
- Automatic Routing of Labeling

**Operations**
- Metrics
- Infrastructure management
- Collaboration

**Facial Recognition**
- Facial Analysis
- Face Comparison

**Data Management & Annotation - Active Learning Tools**
- Smart Data Triage
- Data Labeling Workflow Enhancement
- Error and Outlier Identification
- Data Selection Optimization
- Actionable Insights for Data Quality

**Learning - Machine Learning**
- Training Data
- Actionable Insights
- Algorithm

**Machine/Deep Learning Services**
- Computer Vision
- Natural Language Processing
- Natural Language Generation
- Artificial Neural Networks

**Machine/Deep Learning Services**
- Natural Language Understanding
- Deep Learning

**Image Annotation**
- Image Segmentation

- Object Detection
- Object Tracking
- Data Types

**Management**
- Cataloging
- Monitoring
- Governing

**Labeling**
- Model Training
- Bounding Boxes
- Custom Image Detection

**Model Performance & Analysis - Active Learning Tools**
- Model Performance Insights
- Cost-Effective Model Improvement
- Edge Case Integration
- Fine-tuning Model Accuracy
- Label Outlier Analysis

**Deployment**
- Managed Service
- Application
- Scalability

**Natural Language Annotation**
- Named Entity Recognition
- Sentiment Detection
- OCR

**Deployment**
- Integrations

**Generative AI**
- AI Text Generation
- AI Text Summarization

**Speech Annotation**
- Transcription
- Emotion Recognition

**Generative AI**
- AI Text Generation
- AI Text Summarization
- AI Text-to-Image

**Agentic AI - Data Science and Machine Learning Platforms**
- Autonomous Task Execution
- Multi-step Planning
- Cross-system Integration
- Adaptive Learning
- Natural Language Interaction
- Proactive Assistance
- Decision Making

## Top DagsHub Alternatives
  - [Databricks](https://www.g2.com/products/databricks/reviews) - 4.6/5.0 (740 reviews)
  - [SAS Viya](https://www.g2.com/products/sas-sas-viya/reviews) - 4.3/5.0 (754 reviews)
  - [MATLAB](https://www.g2.com/products/matlab/reviews) - 4.5/5.0 (748 reviews)

