Skip to content

Conversation

platypii
Copy link

@platypii platypii commented Dec 9, 2024

Uses hyparquet for javascript parquet parsing. It is a small, pure js implementation of parquet parsing with no dependencies. Parquet.js that this replaces is unmaintained and has not been updated in 5+ years.

Fixes #102 and #104 by using a well-maintained parquet library that supports modern parquet files.

I tested this with the parquet file generated by together-python and confirmed that upload works and fixes issue #104.

Let me know if I can help with anything!

@platypii
Copy link
Author

@Nutlope? Anyone?

@nicolasembleton
Copy link

This should be straightforward to review.

@platypii
Copy link
Author

platypii commented Jan 2, 2025

@samselikoff?

@platypii
Copy link
Author

Here's a video showing that instructions on together.ai website fail due to parquetjs parsing error. This PR fixes this issue:

together-upload.mp4

@Nutlope @samselikoff anything I can do to help move this along?

@Nutlope
Copy link
Collaborator

Nutlope commented Jan 24, 2025

@platypii thanks so much for reporting and for the PR! We're in the process of fixing some things with the upload. @yogishbaliga, mind taking at this PR when you do your other PR on the upload functionality too?

@platypii
Copy link
Author

@yogishbaliga thoughts? happy to contribute if there's more work that needs done

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Replace parquetjs for better deno compatibility?
3 participants