Forge AI Logo
⚡ Powered by Qwen3 30B

Zero-Cost AI Inference

Production-ready AI API endpoints with no per-request cost. Local LLM inference means your data stays private, responses are fast, and your bill stays at zero.

Request Beta Access
💰

Zero Cost

No per-token or per-request fees. Local inference on dedicated hardware means truly free API calls.

🔒

Privacy First

Your data never leaves the inference server. No third-party logging, no data retention.

Fast Inference

Optimized model serving on Apple Silicon. Low latency responses for real-time applications.

API Endpoints

Four powerful endpoints for common NLP tasks, all powered by Qwen3 30B.

POST /api/summarize

Condense long text into clear, concise summaries. Control output length with the max_length parameter.

Request
{
  "text": "Your long article or document...",
  "max_length": 150
}
Response
{
  "summary": "A concise summary of the input text...",
  "model": "qwen3-30b",
  "tokens_used": 87
}
POST /api/extract

Extract structured data — entities, key facts, or specific fields — from unstructured text.

Request
{
  "text": "John Smith signed the contract on March 5th for $50,000...",
  "fields": ["names", "dates", "amounts"]
}
Response
{
  "extracted": {
    "names": ["John Smith"],
    "dates": ["March 5th"],
    "amounts": ["$50,000"]
  }
}
POST /api/rewrite

Rewrite text in a different tone, style, or reading level. Perfect for content adaptation.

Request
{
  "text": "The quarterly earnings exceeded expectations...",
  "style": "casual",
  "language": "en"
}
Response
{
  "rewritten": "So the company absolutely crushed it this quarter...",
  "original_length": 48,
  "new_length": 52
}
POST /api/sentiment

Analyze the sentiment and emotional tone of text. Returns a score and breakdown.

Request
{
  "text": "This product is absolutely amazing, best purchase I've ever made!"
}
Response
{
  "sentiment": "positive",
  "score": 0.94,
  "emotions": {
    "joy": 0.87,
    "surprise": 0.12
  }
}

Try It Yourself

Copy these curl commands and start making requests. Replace the base URL and API key with your credentials.

CURL Summarize
curl -X POST https://<YOUR-API-URL>/api/summarize \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "text": "Paste your long text here...",
    "max_length": 100
  }'
CURL Extract
curl -X POST https://<YOUR-API-URL>/api/extract \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "text": "Extract entities from this text...",
    "fields": ["names", "dates", "locations"]
  }'
CURL Sentiment Analysis
curl -X POST https://<YOUR-API-URL>/api/sentiment \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "text": "I love this product, it changed my life!"
  }'

Pricing

Simple, transparent, and currently free.

Beta Access

$0

Free during beta period

  • All 4 API endpoints
  • Up to 1,000 requests/day
  • Qwen3 30B model
  • No data retention
  • Bearer token authentication
  • Community support
Request Access

Get Started

Forge AI is currently in closed beta. Request access and we'll get you set up.

Send us an email with your use case and expected volume. We'll reply with your API key and base URL.

📧 Request Access via Email