Question 1

How many tokens can an AI model process at once?

Accepted Answer

The number of tokens a model can process at once is called the context window. Modern models support context windows ranging from 4,000 to over 200,000 tokens. On Wabber's cluster, with 128GB of VRAM, models with large context windows can be run, allowing extensive documents and conversation histories to be processed at once.

Question 2

What does token usage cost in AI?

Accepted Answer

With commercial cloud providers, tokens are charged per thousand or per million, with output tokens being more expensive than input tokens. On Wabber's private cluster, there are no per-token costs because processing takes place locally on hardware in the Netherlands. This makes usage predictable and cost-effective, especially with intensive use.

Question 3

What is the difference between tokens and words?

Accepted Answer

A word can consist of one or more tokens. Short, common words are often a single token, while longer or rarer words are split into multiple tokens. As a rule of thumb, 1 token is approximately 0.75 words in English, and slightly less in Dutch due to longer compound words.

Question 4

Is my data processed securely during tokenization?

Accepted Answer

On Wabber's private cluster, all tokens are processed locally on hardware in the Netherlands. No data leaves the cluster, guaranteeing complete privacy and data sovereignty. This is an important difference from cloud-based AI services where data is sent to external servers.

Token (AI)

What is a token?

How do tokens work?

Example

Why are tokens important?

Related solutions

Frequently asked questions

How many tokens can an AI model process at once?

What does token usage cost in AI?

What is the difference between tokens and words?

Is my data processed securely during tokenization?

Related articles

LLM: from standalone chatbot to knowledge architecture

AI-Readiness Scan

Ready to put your data to work?