Factuality hits json parse errors caused by exceeding token limit

I'm experimenting with the `Factuality` scorer, and with some inputs I see json parsing failures caused by unterminated strings, e.g.:
```
FactualityAccuracyScorer: |-
    SyntaxError: Unterminated string in JSON at position 2196
        at JSON.parse (<anonymous>)
        at parseResponse 
```

This failure seems deterministic in my limited testing. But when I increase the hardcoded max token limit of 512 [here](https://github.com/braintrustdata/autoevals/blob/eebbdf141417fe554332fb570f467c8db5f925e0/js/llm.ts#L220), the failure goes away, suggesting the JSON parsing error is caused by a truncated json string when we hit a max token limit.

Could we make max token limit something that callers can override, the same way they can override the model used for inference [here](https://github.com/braintrustdata/autoevals/blob/eebbdf141417fe554332fb570f467c8db5f925e0/js/llm.ts#L200)?

Alternatively, I see this pending PR from march to simply remove the max token count, authored by @ankrgyl and approved by @ibolmo : https://github.com/braintrustdata/autoevals/pull/120 would that be a preferred approach? (I'm not sure why that PR was approved and not merged: did it turn out to cause more problems than it fixed?)

Let me know if you'd like me to send a PR to add a `maxTokens` argument to [`LLMClassifierFromTemplate`](https://github.com/braintrustdata/autoevals/blob/eebbdf141417fe554332fb570f467c8db5f925e0/js/llm.ts#L256C10-L262) and [`modelGradedSpecSchema`](https://github.com/braintrustdata/autoevals/blob/eebbdf141417fe554332fb570f467c8db5f925e0/js/templates.ts#L14C14-L14C35) to allow callers to override this max token setting: I can happily send you a change.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Factuality hits json parse errors caused by exceeding token limit #149

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Factuality hits json parse errors caused by exceeding token limit #149

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions