Question 1

Is it really free? What is the catch?

Accepted Answer

No catch. We pay for the inference. We use Ripple AI ourselves every day for drafting, debugging, summarizing, and research, and we left the door open so anyone can use it. There is a per-IP rate limit and per-vendor daily caps so a single user cannot burn the whole budget, but normal use never hits them.

Question 2

What models do you offer?

Accepted Answer

Llama 3.3 70B and Qwen 3 235B are the workhorses. We add and retire models as the open-source frontier shifts, so the picker is the source of truth. All of them route through Cloudflare Workers AI, Groq, or Cerebras depending on availability and latency.

Question 3

What is Auto routing?

Accepted Answer

We measure time-to-first-token across vendors in real time. Fast biases toward the snappiest vendor for short tasks. Smart biases toward the heaviest reasoning model. Balanced splits the difference. If a vendor goes down or gets slow, the router falls back to the next healthy one without you noticing.

Question 4

What does Search mode do?

Accepted Answer

Turn on the Search toggle in the composer and the system runs a research fan-out: a planner LLM breaks your question into search queries, runs them in parallel against the web, and a synthesizer model writes the answer with inline citations to the sources it used. Good for anything that depends on current information.

Question 5

Do you save my chats?

Accepted Answer

No. Public chats run through a stateless session. Nothing is written to a database. When you close the tab, the conversation is gone. We see aggregate token counts and latency per vendor for health monitoring, never the content of your messages.

Question 6

Is it as good as ChatGPT or Claude?

Accepted Answer

For most everyday tasks, you will not notice a difference. The frontier closed-source models (GPT and Claude) still have an edge on the hardest reasoning, the longest context, and the most agentic workflows. For drafting, summarizing, code questions, plain-English explainers, and quick research with citations, the open-source models we route to are entirely competent.

Question 7

Why did Ripple build an AI chat tool?

Accepted Answer

Two reasons. One, we wanted a place to run open-source models on infrastructure we control, without a third-party SaaS reading every prompt our team writes. Two, we ship products for a living, and shipping our own products in public is how we prove that. Ripple AI sits next to Ripple Meet and GST Invoice Generator on /tools as evidence we build, not just consult.

Question 8

Are there usage limits?

Accepted Answer

A per-IP rate limit on chat requests to keep abuse out, and a per-vendor daily token budget so one bad day with a runaway script does not burn the whole month. Normal interactive use never trips either. If you hit a limit you will get a clear message, not a silent failure.

Question 9

What is coming next?

Accepted Answer

On the build queue: file uploads for context, longer running research with deeper fetches, image inputs, and a saved-conversation mode for signed-in admins. The privacy default for the public chat (nothing saved) stays.

We built our own AI chat.
You’re invited.

Open a URL. Start typing.

Auto routing across three vendors.

Search the web, with citations.

Top open-source models, no API key required.

Why does a design studio build an AI chat tool?

Today, and what we’re building next

Use it today for

Building next

Ready when you are.

How it works

Click Start chatting

Pick a model, or leave it on Auto

Type your question, hit send

Questions

We built our own AI chat.You’re invited.