DeepSeek-TNG-R1T2-Chimera is TNG Tech's second-generation Chimera text-generation model. Built from DeepSeek-AI’s R1-0528, R1, and V3-0324 checkpoints using Assembly-of-Experts merging, this 671B-parameter model combines strengths from all three. Its tri-parent design delivers strong reasoning ability while being about 20% faster than the original R1 and over twice as fast as R1-0528 on vLLM, providing a great balance of cost and performance. The model supports up to 60k-token input (tested up to ~130k) and stable <think> token behavior, making it ideal for long-context analysis, dialogue, and general text generation.
from openai import OpenAI
client = OpenAI(
base_url="https://api.naga.ac/v1",
api_key="YOUR_API_KEY",
)
resp = client.chat.completions.create(
model="deepseek-r1t2-chimera:free",
messages=[
{"role": "user", "content": "What's 2+2?"}
],
temperature=0.2,
)
print(resp.choices[0].message.content)