CoqPilot refinement: Grazie bug fix, new limiting context model parameters, unsafe code refactor #55

GlebSolovev · 2025-01-17T12:21:57Z

🐞 Fixed GrazieService bug. Resolved an issue where it generated choices + 1 proofs instead of the expected choices. This was caused by an improperly updated loop after a prior refactor.
🆕 Added and tested new model parameters. Introduced maxContextTheoremsNumber and multiroundProfile.maxPreviousProofVersionsNumber to better control the context sent to LLMs.
- Previously, users could only limit the context by adjusting tokensToGenerate, but these new parameters provide a more intuitive and precise way to manage it.
- These additions are essential for experiments exploring how premises influence CoqPilot's proof generation capabilities.
💂‍♂️ Refactored potentially unsafe code:
- In the project's early stages, we (or at least I) sometimes used misleading as type casts without realizing they only perform compile-time checks and have no effect at runtime. These cases have now been reviewed, and misleading casts were replaced with proper runtime checks or refactored entirely.
- Updated constructors for all custom errors to ensure consistent and reliable typing in all scenarios, following the approach established in the last benchmarking framework refactor.
- Revised and refactored error-throwing code: replaced most of the generic Error-s with more specific ones using previously introduced error-utils wrappers.

Also refactor some of them

GlebSolovev · 2025-01-29T13:03:43Z

GlebSolovev added 7 commits January 17, 2025 05:28

Fix GrazieService generating choices + 1 proofs bug

db3f0d3

Support maxContextTheoremsNumber parameter

dc0f37b

Update test with maxContextTheoremsNumber

be12184

Also refactor some of them

Update package.json and README with maxContextTheoremsNumber

e24436f

Support maxPreviousProofVersionsNumber parameter

3874f9b

Refactor: fix misleading as casts

a521735

Refactor: refine all custom errors ctors

6e2e24a

GlebSolovev self-assigned this Jan 17, 2025

GlebSolovev added 2 commits January 29, 2025 12:46

Refactor error-throwing in main code

2cce131

Fix CoqLspClient throwing unwrapped error

26b3d8c

Base automatically changed from benchmarking-multiround to v2.5.0-dev January 29, 2025 11:58

GlebSolovev merged commit 6351238 into v2.5.0-dev Jan 29, 2025
3 checks passed

GlebSolovev deleted the coqpilot-refinement branch January 29, 2025 13:04

K-dizzled self-requested a review January 29, 2025 19:21

Provide feedback