Best Small Language Models (SLMs) with Open source Capabilities

(1)5.0 out of 5

OverviewSeller Details

Product Description

BLOOM-560m is a transformer-based language model developed by BigScience, designed to facilitate research in large language models (LLMs). It serves as a pre-trained base model capable of generating h

Demographics

UsersNo information available

IndustriesNo information available

Market Segment100% Enterprise

Seller Details

Seller

Year Founded

2016

HQ Location

United States

Twitter

@huggingface
652,163 Twitter followers

LinkedIn® Page

www.linkedin.com
636 employees on LinkedIn®

Sponsored

G2 Advertising

Get 2x conversion than Google Ads with G2 Advertising!

G2 Advertising places your product in premium positions on high-traffic pages and on targeted competitor pages to reach buyers at key comparison moments.

OverviewPros and Cons

Product Description

Granite-3.1-3B-A800M-Base is a state-of-the-art language model developed by IBM, designed to handle complex natural language processing tasks with high efficiency. This model employs a sparse Mixture

Demographics

UsersNo information available

IndustriesNo information available

Market Segment100% Small-Business

Pros and ConsSeller Details

ProsFree Services, Open Source, Search Features, UI Design, Updates

Seller Details

Seller

Year Founded

1911

HQ Location

Armonk, NY

Twitter

@IBM
708,652 Twitter followers

LinkedIn® Page

www.linkedin.com
339,241 employees on LinkedIn®

Ownership

SWX:IBM

Phi 3 Mini 128k

By Microsoft

(1)5.0 out of 5

OverviewSeller Details

Product Description

Microsoft Azure’s Phi 3 model redefining large-scale language model capabilities in the cloud.

Demographics

UsersNo information available

IndustriesNo information available

Market Segment100% Mid-Market

Seller Details

Seller

Microsoft

Year Founded

1975

HQ Location

Redmond, Washington

Twitter

@microsoft
13,090,136 Twitter followers

LinkedIn® Page

www.linkedin.com
226,132 employees on LinkedIn®

Ownership

MSFT

bloom 1b1

OverviewSeller Details

Product Description

BLOOM-1b1 is a multilingual language model developed by the BigScience Workshop, designed to generate human-like text across 48 languages. As a transformer-based model, it utilizes a decoder-only arch

Demographics

We don't have enough data from reviews to share who uses this product. Leave a review to contribute, or learn more about review generation.

Seller Details

Seller

Year Founded

2016

HQ Location

United States

Twitter

@huggingface
652,163 Twitter followers

LinkedIn® Page

www.linkedin.com
636 employees on LinkedIn®

bloom 1b7

OverviewSeller Details

Product Description

BLOOM-1b7 is a transformer-based language model developed by the BigScience Workshop, designed to generate human-like text across 48 languages. As a scaled-down variant of the larger BLOOM model, it o

Demographics

We don't have enough data from reviews to share who uses this product. Leave a review to contribute, or learn more about review generation.

Seller Details

Seller

Year Founded

2016

HQ Location

United States

Twitter

@huggingface
652,163 Twitter followers

LinkedIn® Page

www.linkedin.com
636 employees on LinkedIn®

bloom 3b

OverviewSeller Details

Product Description

BLOOM-3B is a 3-billion parameter multilingual language model developed by the BigScience initiative. As a scaled-down version of the larger BLOOM model, it maintains the same architecture and trainin

Demographics

We don't have enough data from reviews to share who uses this product. Leave a review to contribute, or learn more about review generation.

Seller Details

Seller

Year Founded

2016

HQ Location

United States

Twitter

@huggingface
652,163 Twitter followers

LinkedIn® Page

www.linkedin.com
636 employees on LinkedIn®

bloom 7b1

OverviewSeller Details

Product Description

BLOOM-7B1 is a multilingual language model developed by BigScience, designed to generate human-like text across 48 languages. With over 7 billion parameters, it leverages a transformer-based architect

Demographics

We don't have enough data from reviews to share who uses this product. Leave a review to contribute, or learn more about review generation.

Seller Details

Seller

Year Founded

2016

HQ Location

United States

Twitter

@huggingface
652,163 Twitter followers

LinkedIn® Page

www.linkedin.com
636 employees on LinkedIn®

granite 3.1 MoE 1b

OverviewSeller Details

Product Description

Granite-3.1-1B-A400M-Base is a language model developed by IBM's Granite Team, designed to handle extensive context lengths up to 128K tokens. This model is based on a decoder-only sparse Mixture of E

Demographics

We don't have enough data from reviews to share who uses this product. Leave a review to contribute, or learn more about review generation.

Seller Details

Seller

Year Founded

1911

HQ Location

Armonk, NY

Twitter

@IBM
708,652 Twitter followers

LinkedIn® Page

www.linkedin.com
339,241 employees on LinkedIn®

Ownership

SWX:IBM

granite 3.2 2b

OverviewSeller Details

Product Description

Granite-3.2-2B-Instruct is a 2-billion-parameter language model developed by IBM's Granite Team, designed to handle a wide range of instruction-following tasks. Built upon its predecessor, Granite-3.1

Demographics

We don't have enough data from reviews to share who uses this product. Leave a review to contribute, or learn more about review generation.

Seller Details

Seller

Year Founded

1911

HQ Location

Armonk, NY

Twitter

@IBM
708,652 Twitter followers

LinkedIn® Page

www.linkedin.com
339,241 employees on LinkedIn®

Ownership

SWX:IBM

granite 3.2 8b

OverviewSeller Details

Product Description

Granite-3.2-8B-Instruct is an 8-billion-parameter AI model fine-tuned for advanced reasoning tasks. Built upon its predecessor, Granite-3.1-8B-Instruct, it has been trained using a combination of perm

Demographics

We don't have enough data from reviews to share who uses this product. Leave a review to contribute, or learn more about review generation.

Seller Details

Seller

Year Founded

1911

HQ Location

Armonk, NY

Twitter

@IBM
708,652 Twitter followers

LinkedIn® Page

www.linkedin.com
339,241 employees on LinkedIn®

Ownership

SWX:IBM

granite 3.3 2b

OverviewSeller Details

Product Description

Granite-3.3-2B-Instruct is a 2-billion parameter language model developed by IBM's Granite Team, designed to enhance reasoning and instruction-following capabilities. With a context length of 128K tok

Demographics

We don't have enough data from reviews to share who uses this product. Leave a review to contribute, or learn more about review generation.

Seller Details

Seller

Year Founded

1911

HQ Location

Armonk, NY

Twitter

@IBM
708,652 Twitter followers

LinkedIn® Page

www.linkedin.com
339,241 employees on LinkedIn®

Ownership

SWX:IBM

granite 3.3 8b

OverviewSeller Details

Product Description

Granite-3.3-8B-Instruct is an advanced language model developed by IBM's Granite Team, featuring 8 billion parameters and a 128K context length. Fine-tuned for enhanced reasoning and instruction-follo

Demographics

We don't have enough data from reviews to share who uses this product. Leave a review to contribute, or learn more about review generation.

Seller Details

Seller

Year Founded

1911

HQ Location

Armonk, NY

Twitter

@IBM
708,652 Twitter followers

LinkedIn® Page

www.linkedin.com
339,241 employees on LinkedIn®

Ownership

SWX:IBM

granite 4 tiny

OverviewSeller Details

Product Description

Granite-4.0-Tiny-Preview is a 7-billion-parameter fine-grained hybrid mixture-of-experts (MoE) instruction-following model developed by IBM's Granite Team. Fine-tuned from the Granite-4.0-Tiny-Base-Pr

Demographics

We don't have enough data from reviews to share who uses this product. Leave a review to contribute, or learn more about review generation.

Seller Details

Seller

Year Founded

1911

HQ Location

Armonk, NY

Twitter

@IBM
708,652 Twitter followers

LinkedIn® Page

www.linkedin.com
339,241 employees on LinkedIn®

Ownership

SWX:IBM

granite 4 tiny base

OverviewSeller Details

Product Description

Granite-4.0-Tiny-Base-Preview is a 7-billion-parameter hybrid mixture-of-experts (MoE) language model developed by IBM's Granite Team. It features a 128,000-token context window and utilizes the Mamba

Demographics

We don't have enough data from reviews to share who uses this product. Leave a review to contribute, or learn more about review generation.

Seller Details

Seller