Best Image Recognition Software

Image recognition, also known as computer vision, allows applications using specific deep learning algorithms to understand images or videos. In these scenarios, images are data in the sense that they are inputted into an algorithm, the algorithm performs a requested task, and the algorithm outputs a solution provided by the image. One common execution for computer vision applications includes facial recognition—whether for tagging friends on Facebook or a police department identifying a potential suspect—solely based on an image. Another use for image recognition is in the medical field, where artificial intelligence, using image recognition, can observe an x-ray and decipher the diagnosis solely based on the image. Some other aspects of image recognition include image restoration, object recognition, and scene reconstruction. These capabilities may be embedded inside intelligent applications or offered as deep learning algorithms inAI platforms.

To qualify for inclusion in the Image Recognition category, a product must:

  • Provide a deep learning algorithm specifically for image recognition
  • Connect with image data pools to learn a specific solution or function
  • Consume the image data as an input and provide an outputted solution
G2 Grid® for Image Recognition
Leaders
High Performers
Contenders
Niche
Market Presence
Satisfaction
content
Star Rating

Image Recognition reviews by real, verified users. Find unbiased ratings on user satisfaction, features, and price based on the most reviews available anywhere.

Compare Image Recognition Software

Results: 75
G2 takes pride in showing unbiased ratings on user satisfaction. G2 does not allow for paid placement in any of our ratings.
Results: 75
Filter Results
Filter by:
Sort by
Star Rating
Sort By:

    Microsoft Computer Vision API is a cloud-based API tool that provides developers with access to advanced algorithms for processing images and returning informatio, by uploading an image or specifying an image URL, it analyze visual content in different ways based on inputs and user choices.

    Amazon Rekognition makes it easy to add image and video analysis to your applications. It can identify the objects, people, text, scenes, and activities, or any inappropriate content from an image or video.

    IBM Watson Visual Recognition is a tool that allow users to automatically identify subjects and objects contained within the image and organize and classify these images into logical categories.

    Azure Custom Vision Service is a tool for building custom image classifiers, and for making them better over time. This service enables you to identify your own objects and things in images.

    Microsoft Video API is a cloud-based API that provides advanced algorithms for tracking faces, detecting motion, stabilizing and creating thumbnails from video, it allows user to build more personalized and intelligent apps by understanding and automatically transforming video content.

    Derive insight from images with our powerful Cloud Vision API

    OpenCV is a tool that has has C++, C, Python and Java interfaces and supports Windows, Linux, Mac OS, iOS and Android for computational efficiency and with a strong focus on real-time applications, written in optimized C/C++, the library can take advantage of multi-core processing and enabled to take advantage of the hardware acceleration of the underlying heterogeneous compute platform

    Azure Face API uses state-of-the-art cloud-based face algorithms to detect and recognize human faces in images.Its capabilities include features like face detection, face verification, and face grouping to organize faces into groups based on their visual similarity.

    SimpleCV is an open source framework for building computer vision applications, user can get access to several high-powered computer vision libraries such as OpenCV without having to learn about bit depths, file formats, color spaces, buffer management, eigenvalues, or matrix versus bitmap storage.

    Clarifai
    (11)4.6 out of 5
    Optimized for quick response
    Optimized for quick response

    Clarifai offers a suite of tools that make it easy for anyone to quickly and accurately train, customize, and use machine learning-powered image and video recognition in their products.

    Microsoft Emotion API is a tool that analyze faces to detect a range of feelings and personalize your app's responses.

    scikit-image is a collection of algorithms for image processing.

    And this is where Google's deep dream ideas originate. With simple words you give to an AI program a couple of images and let it know what those images contain ( what objects - dogs, cats, mountains, bicycles, ... ) and give it a random image and ask it what objects it can find in this image.

    DeepPy is a MIT licensed deep learning framework that tries to add a touch of zen to deep learning as it allows for Pythonic programming based on NumPy's ndarray,has a small and easily extensible codebase, runs on CPU or Nvidia GPUs and implements the following network architectures feedforward networks, convnets, siamese networks and autoencoders.

    Founded in 2013, Hive is a full-stack deep learning company focused on solving visual intelligence challenges for enterprises and governments through three main pillars of the business: Hive Data, Hive Predict, and Hive Media. Hive Data is the world's largest two-sided marketplace for machine learning data labeling. We have over 800,000 distributed workers helping research labs and enterprise clients generate high quality truth data sets in fields like autonomous driving and robotics. Hive Predict is our set of proprietary visual intelligence models that solve problems like the identification of celebrities and logos. We sell these APIs to companies looking to incorporate our models into their own workflows. Our flagship product is Hive Media, through which we are selling the world's largest television analytics dataset.

    MobileEngine makes it easy for you to add image recognition to your app. You provide a reference database of images (e.g. artwork, consumer packaged goods, book covers, catalog pages, etc.) and when your users photograph that object, MobileEngine finds your matching reference image.

    PCV is an open source Python module for computer vision

    Vize.ai AI is custom image recognition and classification API, designed to allow developers and businesses to analyze image data.

    Alibaba Cloud Image Search is an intelligent image search service that helps users find similar or identical images. Based on machine learning and deep learning, the product enables end-users to take a screenshot or upload an image to search and find desired products and fulfill other search requests

    Caffe is a deep learning framework made with expression, speed, and modularity in mind.

    IBM Intelligent Video Analytics helps security and public safety organizations develop comprehensive security, intelligence and investigative capabilities using video.

    VIGRA is a computer vision library that puts its main emphasis on flexible algorithms.

    Azure Video Indexer enables customers with digital video and audio content to automatically extract metadata and use it to build intelligent innovative applications.

    Gesture Recognition Toolkit (GRT) is a cross-platform, open-source, C++ machine learning library designed for real-time gesture recognition.

    GraphicsMagick is a collection of tools and libraries to read, write, and manipulate an image in any of the more popular image formats.

    OpenFace is a Python and Torch implementation of face recognition with deep neural networks and is based on the CVPR 2015

    Allows you to implement image recognition technology within your web or mobile applications.

    Torch is a scientific computing framework with wide support for machine learning algorithms that puts GPUs first.

    VLFeat is an open source library that implements popular computer vision algorithms specializing in image understanding and local features extraction and matching, it include Fisher Vector, VLAD, SIFT, MSER, k-means, hierarchical k-means, agglomerative information bottleneck, SLIC superpixels, quick shift superpixels, large scale SVM training, and many others. It is written in C for efficiency and compatibility, with interfaces in MATLAB for ease of use, and detailed documentation throughout. It supports Windows, Mac OS X, and Linux.

    WineEngine is powered by TinEye's unparalleled image recognition technology and has been engineered and optimized to work with photographs captured by users' smart devices.

    3VR is a video technology and data company that solves the challenges associated with video searchability, allowing customers to rapidly gather real-time intelligence from the unstructured video data that is produced by a single camera or a global network of cameras.

    ACTi video analytics are designed to help you transform your video surveillance network into a smart detection system and a valuable resource for business management.

    AForge.Vision is a vision library that contains some vision classes - set of motion detection algorithms.

    Anyline is a SDK that allows to build real-time mobile OCR apps with highest recognition rates, not requiring any server infrastructure.

    Aurora Image Item Processing is a suite of check processing designed to provide superior performance and accuracy for your check processing operations.

    Blitline provides a platform for processing (cropping, rotating, compositing, filtering) images in a massively parallel environment.

    CraftAR by Catchoom is an image recognition and augmented reality platform for mobile and web applications.

    CCV is a open source/cross-platform solution for blob tracking with computer vision. that can interface with various web cameras and video devices as well as connect to various TUIO/OSC/XML enabled applications and supports many multi-touch lighting techniques including: FTIR, DI, DSI, and LLP with expansion planned for the future vision applications (custom modules/filters).

    CEREBRO applies cutting-edge technology, to gather real-time data from various data sources, including high-end sensors with demographic recognition capabilities to make sure you will reach your right audience and spend your budgets wisely.

    Cloudsight is an image recognition API providing true understanding for your digital media.

    DeepAI allows users to learn about data science and play with the latest research.

    DeepLearningKit is an open source Deep Learning Framework for Apple's iOS, OS X & tvOS. Developed in Swift & Metal (GPU acceleration).

    Density offers a people counter & API.

    Image Processing is a tool that deals with array2d objects that contain various kinds of pixels or user defined generic image objects.

    Eblearn is an object-oriented C++ library that implements various machine learning models, including energy-based learning, gradient-based learning for machine composed of multiple heterogeneous modules.

    ENVI SARscape allows you to easily process and analyze SAR data acquired from all existing spaceborne and selected airborne platforms. Generate products, and integrate information with other geospatial products.

    Face++ provides APIs and SDKs for facial recognition.

    FaceX provides a face detection and face recognition web service that can be integrated to your apps with just a few lines of code.

    Imagga is an image recognition platform-as-a-service that provides image tagging APIs for developers and businesses to build scalable, image and video intensive apps.