Name: Trinity-Large-Thinking
Rating: 12 (4 reviews)
Author: Arcee AI

Question 1

How does Trinity-Large-Thinking perform overall in AI benchmarks?

Accepted Answer

Trinity-Large-Thinking has 4 published benchmark scores on BenchLM, but it does not yet have enough non-generated coverage to receive a global overall rank.

Question 2

Is Trinity-Large-Thinking good for knowledge and understanding?

Accepted Answer

Trinity-Large-Thinking has visible benchmark coverage in knowledge and understanding, but BenchLM does not currently assign it a global category rank there.

Question 3

Is Trinity-Large-Thinking good for coding and programming?

Accepted Answer

Trinity-Large-Thinking has visible benchmark coverage in coding and programming, but BenchLM does not currently assign it a global category rank there.

Question 4

Is Trinity-Large-Thinking good for mathematics?

Accepted Answer

Trinity-Large-Thinking has visible benchmark coverage in mathematics, but BenchLM does not currently assign it a global category rank there.

Question 5

Is Trinity-Large-Thinking open source?

Accepted Answer

Yes, Trinity-Large-Thinking is an open weight model created by Arcee AI, meaning it can be downloaded and run locally or fine-tuned for specific use cases.

Question 6

Which sibling models are related to Trinity-Large-Thinking?

Accepted Answer

Trinity-Large-Thinking belongs to the Trinity Large family. Related variants on BenchLM include Trinity-Large-Preview.

Question 7

Does Trinity-Large-Thinking have full benchmark coverage on BenchLM?

Accepted Answer

Not yet. Trinity-Large-Thinking currently has 4 published benchmark scores out of the 193 benchmarks BenchLM tracks. BenchLM only exposes non-generated public benchmark rows, so missing categories stay blank until a sourced evaluation is available.

Question 8

What is the context window size of Trinity-Large-Thinking?

Accepted Answer

Trinity-Large-Thinking has a context window of 512K, which determines how much text it can process in a single interaction.

Trinity-Large-Thinking

Ranking Distribution

Category Performance

Category Breakdown

Agentic

Coding

Reasoning

Knowledge

Math

Multilingual

Multimodal

Inst. Following

Chatbot Arena Performance

Benchmark Details

Trinity Large Family

Compare This Model

Frequently Asked Questions