TNGTech

Token usage over time

Browse models from TNGTech

1 model

DeepSeek-TNG-R1T2-Chimera is TNG Tech's second-generation Chimera text-generation model. Built from DeepSeek-AI’s R1-0528, R1, and V3-0324 checkpoints using Assembly-of-Experts merging, this 671B-parameter model combines strengths from all three. Its tri-parent design delivers strong reasoning ability while being about 20% faster than the original R1 and over twice as fast as R1-0528 on vLLM, providing a great balance of cost and performance. The model supports up to 60k-token input (tested up to ~130k) and stable <think> token behavior, making it ideal for long-context analysis, dialogue, and general text generation.

byTNGTech
Free