Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Only fetches a subset of statuses #3

Open
andypiper opened this issue Jan 2, 2024 · 3 comments
Open

Only fetches a subset of statuses #3

andypiper opened this issue Jan 2, 2024 · 3 comments

Comments

@andypiper
Copy link

Running this against my account, I have a total of 6402 statuses according to the web UI (as of this morning), but the database only has 4368 rows in it. Investigating...

@olithissen
Copy link
Owner

Interesting. I remember something like "statuses" not actually being "posts" but "interactions" in general and might also include favourites and reblogs.

@andypiper
Copy link
Author

andypiper commented Jan 2, 2024

You could be right here (and, given I do developer relations at Mastodon, I should probably know the answer... 😬 ... let me go ask the eng team!)

I've modified locally to exclude reblogs now (avoiding the null rows), and also have a regex to simplify the post content values to strip the markup. Not entirely sure where you are going with this, but happy to contribute if useful.

@andypiper
Copy link
Author

... although I suppose, now that I'm discovering the different output formats the DuckDB CLI supports, there might be value in retaining the markup, for example to render subsets of query results into HTML. 🤔

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants