Skip to content

Embeddings chunking fixes #6261

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Draft
wants to merge 4 commits into
base: main
Choose a base branch
from
Draft

Conversation

RomneyDa
Copy link
Collaborator

Description

[ What changed? Feel free to be brief. ]

Checklist

  • [] I've read the contributing guide
  • [] The relevant docs, if any, have been updated or created
  • [] The relevant tests, if any, have been updated or created

Screenshots

[ For visual changes, include screenshots. Screen recordings are particularly helpful, and appreciated! ]

Tests

[ What tests were added or updated to ensure the changes work as expected? ]

@RomneyDa RomneyDa requested a review from a team as a code owner June 23, 2025 16:30
@RomneyDa RomneyDa requested review from sestinj and removed request for a team June 23, 2025 16:30
Copy link

netlify bot commented Jun 23, 2025

Deploy Preview for continuedev canceled.

Name Link
🔨 Latest commit 20a6a54
🔍 Latest deploy log https://app.netlify.com/projects/continuedev/deploys/68598119e5b55d0008631e5f

@dosubot dosubot bot added the size:XL This PR changes 500-999 lines, ignoring generated files. label Jun 23, 2025
// `cross-env` seems to make it so __dirname is the root of the project and not the directory containing this file
return path.join(__dirname, "llm", workerFileName);
}
// if (process.env.NODE_ENV === "test") {
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Commenting out the test environment path handling will break the worker file resolution during tests. The code in build.js and package.json files shows that worker files are moved to different locations during build and testing. Without the test-specific path logic, the tokenizer workers won't be found in the test environment, causing the LlamaAsyncEncoder and GPTAsyncEncoder to fail.


React with 👍 to tell me that this comment was useful, or 👎 if not (and I'll stop posting more comments like this in the future)

Copy link

recurseml bot commented Jun 23, 2025

😱 Found 1 issue. Time to roll up your sleeves! 😱

@RomneyDa RomneyDa marked this pull request as draft June 23, 2025 17:58
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
size:XL This PR changes 500-999 lines, ignoring generated files.
Projects
Status: Todo
Development

Successfully merging this pull request may close these issues.

1 participant