Sarthak's AI2 Site

Groq API- All you need to know!

Table of Contents

Groq API- All you need to know!

Production Models

Preview Models

Preview Systems

Available Agentic Tools

Billing

How to integrate Groq Text LLMs in your projects in MIT AI2, Kodular or other distros?

How to integrate PlayAI TTS via Groq in your projects in MIT AI2, Kodular or other distros?

Groq, Inc. is an American artificial intelligence company that builds an AI accelerator application-specific integrated circuit that they call the Language Processing Unit and related hardware to accelerate the inference performance of AI workloads. AI models running on Groq run at 10x speed* compared to traditional GPU based workloads.

Production Models

These model are ideal for use in Production environments i.e. your app and they may not be discontinued.

Preview Models

Feel free to use them they are the latest models in testing

Preview Systems

Systems consisting of Models and agentic tools such as Code Execution and Search.

While LLMs excel at generating text, compound-beta takes the next step. It's an advanced AI system that is designed to solve problems by taking action and intelligently uses external tools - starting with web search and code execution - alongside the powerful Llama 4 models and Llama 3.3 70b model. This allows it access to real-time information and interaction with external environments, providing more accurate, up-to-date, and capable responses than an LLM alone.

Available Agentic Tools

There are two agentic tool systems available:

compound-beta: supports multiple tool calls per request. This system is great for use cases that require multiple web searches or code executions per request.
compound-beta-mini: supports a single tool call per request. This system is great for use cases that require a single web search or code execution per request. compound-beta-mini has an average of 3x lower latency than compound-beta.

Both systems support the following tools:

Web Search via Tavily
Code Execution via E2B (only Python is currently supported)

Billing

Groq provides one of the most affordable api rates. They have a pretty generous free plan too that gives you the ability to use all available models with free tokens daily and trust me when I say they are a lot.

Rate limits are measured in:

RPM: Requests per minute

RPD: Requests per day

TPM: Tokens per minute

TPD: Tokens per day

ASH: Audio seconds per hour

ASD: Audio seconds per day

Rate limits apply at the organization level, not individual users. You can hit any limit type depending on which threshold you reach first.

Example: Let's say your RPM = 50 and your TPM = 200K. If you were to send 50 requests with only 100 tokens within a minute, you would reach your limit even though you did not send 200K tokens within those 50 requests.

This is my usage graph. I have made five apps with average 500 users and see the affordability of Groq. I bet everything else ChatGPT or Gemini are more expensive than Groq.

How to integrate Groq Text LLMs in your projects in MIT AI2, Kodular or other distros?

To integrate Groq text based models in your application. Try GroqText extension that supports all of these models including Vision models.

Learn more about GroqText here: MIT AI2 Community Post, Kodular Community Post, Niotron Community Post

Purchase the GroqText Extension (5.99$)

How to integrate PlayAI TTS via Groq in your projects in MIT AI2, Kodular or other distros?

To integrate PlayAI TTS to generate realistic sound in your application. Try PlayAI extension that supports PlayAI TTS (groq).

Learn more about PlayAI extension here: MIT AI2 Community Post, Kodular Community Post, Niotron Community

Purchase the PlayAI Extension (7.99$)

Google Sites

Report abuse