testing framework for transformRequest+Response #72

knjiang · 2026-01-29T02:45:53Z

This PR adds testing framework for transformRequest and transformResponse. This is specifically an offline testing framework where we test lingua transformations against saved snapshots.

We use the existing cases and WASM bindings generated from previous PR transform_request, transform_response and validate_*_json to verify valid transforms and no regressions.

High level, my mental model is:

coverage-report ensures internal consistency between the Universal model & all the providers
transforms ensures external compatibility with OpenAPI schema validations during every transform and using the actual SDK post-transform.

Diagram:

  Phase 1: Capture (one-time, requires API keys)                                                                                
                                                                                                                                
  getCaseForProvider(caseName, source)                                                                                          
             │                                                                                                                  
             ▼                                                                                                                  
  ┌─────────────────────────────────────┐                                                                                       
  │   transformAndValidateRequest()     │                                                                                       
  │   • transform_request() ────────────┼──► validate_*_request()                                                               
  └──────────────────┬──────────────────┘                                                                                       
                     │                                                                                                          
                     ▼                                                                                                          
  ┌─────────────────────────────────────┐                                                                                       
  │   callProvider(target, request)     │                                                                                       
  │   • openai.chat.completions.create()│                                                                                       
  │   • anthropic.messages.create()     │                                                                                       
  └──────────────────┬──────────────────┘                                                                                       
                     │                                                                                                          
                     ▼                                                                                                          
           validate_*_response()                                                                                                
                     │                                                                                                          
                     ▼                                                                                                          
           writeFileSync(path, response)                                                                                        
           transforms/{src}_to_{tgt}/{case}.json                                                                                
                                                                                                                                
  Phase 2: Test (CI, no API calls)                                                                                              
                                                                                                                                
  getCaseForProvider(caseName, source)                                                                                          
             │                                                                                                                  
             ▼                                                                                                                  
  ┌─────────────────────────────────────┐                                                                                       
  │   transformAndValidateRequest()     │                                                                                       
  │   • transform_request() ────────────┼──► validate_*_request()                                                               
  └──────────────────┬──────────────────┘                                                                                       
                     │                                                                                                          
                     ▼                                                                                                          
             toMatchSnapshot("request")                                                                                         
                     │                                                                                                          
                     ▼                                                                                                          
  ┌─────────────────────────────────────┐                                                                                       
  │   loadAndValidateResponse()         │                                                                                       
  │   • readFileSync(path) ─────────────┼──► validate_*_response()                                                              
  └──────────────────┬──────────────────┘                                                                                       
                     │                                                                                                          
                     ▼                                                                                                          
  ┌─────────────────────────────────────┐                                                                                       
  │   transformResponseData()           │                                                                                       
  │   • transform_response() ───────────┼──► validate_*_response()                                                              
  └──────────────────┬──────────────────┘                                                                                       
                     │                                                                                                          
                     ▼                                                                                                          
             toMatchSnapshot("response")

knjiang · 2026-01-29T02:46:12Z

Warning

This pull request is not mergeable via GitHub because a downstack PR is open. Once all requirements are satisfied, merge this PR as a stack on Graphite.
Learn more

testing framework for transformRequest+Response #72 👈 (View in Graphite)
lingua-wasm bindings for request/response #69
Add universal param configs #61 : 1 other dependent PR (#70 )
add anthropic messages parameter test cases #59
add chat completion parameter test cases #58
add openai responses parameter test cases #54
main

This stack of pull requests is managed by Graphite. Learn more about stacking.

knjiang · 2026-01-29T02:47:36Z

payloads/transforms/responses_to_anthropic/reasoningRequestTruncated.json

+{
+  "error": "400 {\"type\":\"error\",\"error\":{\"type\":\"invalid_request_error\",\"message\":\"`max_tokens` must be greater than `thinking.budget_tokens`. Please consult our documentation at https://docs.claude.com/en/docs/build-with-claude/extended-thinking#max-tokens-and-context-window-size\"},\"request_id\":\"req_011CXasVyLu26rs4f6bS7DRJ\"}",
+  "name": "Error"
+}


only error i found so far, this seems to be valid since we define in our case max_tokens = 100 with high reasoning.

idk if we want to throw our own error or something. https://github.com/braintrustdata/lingua/blob/main/payloads/cases/simple.ts#L146

knjiang · 2026-01-29T04:10:29Z

crates/lingua/src/universal/response.rs

+                map.insert(
+                    "input_tokens_details".into(),
+                    serde_json::json!({ "cached_tokens": self.prompt_cached_tokens.unwrap_or(0) }),
+                );


caught this from the validate_response_json js binding. responses require input_token_details and output_token_details

knjiang · 2026-01-29T04:34:34Z

payloads/scripts/transforms/lingua-capture.ts

+}
+/* eslint-enable @typescript-eslint/consistent-type-assertions */
+
+const isParamCase = (name: string) => name.endsWith("Param");


temporary, i didn't want to explode the diff so doing this here.

knjiang · 2026-01-29T05:05:56Z

payloads/transforms/anthropic_to_chat-completions/complexReasoningRequest.json

@@ -0,0 +1,35 @@
+{


anthropic_to_chatcompletions means anthropic payload using a chat completions model - we save the actual chat completion response payload so we don't have to incur the LLM cost.

knjiang mentioned this pull request Jan 29, 2026

lingua-wasm bindings for request/response #69

Open

knjiang commented Jan 29, 2026

View reviewed changes

knjiang force-pushed the 01-28-testing_framework_for_transformrequest_response branch from 0cc799d to 6093a42 Compare January 29, 2026 02:50

knjiang force-pushed the 01-27-request_typescript_and_python_bindings branch 2 times, most recently from 9e4d458 to e383b62 Compare January 29, 2026 04:07

knjiang force-pushed the 01-28-testing_framework_for_transformrequest_response branch from 6093a42 to 5c74eae Compare January 29, 2026 04:07

knjiang commented Jan 29, 2026

View reviewed changes

knjiang marked this pull request as ready for review January 29, 2026 05:04

knjiang commented Jan 29, 2026

View reviewed changes

testing framework for transformRequest+Response

0c166a9

knjiang force-pushed the 01-28-testing_framework_for_transformrequest_response branch from 5c74eae to 0c166a9 Compare January 29, 2026 05:38

knjiang force-pushed the 01-27-request_typescript_and_python_bindings branch from e383b62 to 8d60e4b Compare January 29, 2026 05:38

knjiang requested review from ankrgyl and remh January 30, 2026 20:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

testing framework for transformRequest+Response #72

testing framework for transformRequest+Response #72

knjiang commented Jan 29, 2026 •

edited

Loading

Uh oh!

knjiang commented Jan 29, 2026

Uh oh!

knjiang Jan 29, 2026 •

edited

Loading

Uh oh!

knjiang Jan 29, 2026 •

edited

Loading

Uh oh!

knjiang Jan 29, 2026

Uh oh!

knjiang Jan 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

testing framework for transformRequest+Response #72

Are you sure you want to change the base?

testing framework for transformRequest+Response #72

Conversation

knjiang commented Jan 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

knjiang commented Jan 29, 2026

Uh oh!

knjiang Jan 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

knjiang Jan 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

knjiang Jan 29, 2026

Choose a reason for hiding this comment

Uh oh!

knjiang Jan 29, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

knjiang commented Jan 29, 2026 •

edited

Loading

knjiang Jan 29, 2026 •

edited

Loading

knjiang Jan 29, 2026 •

edited

Loading