migration ignores documents already in DB #445
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Modifies
fetch_sheet
to skip documents that already exist in the database. To determine if a document exists, this function uses the querydocument_id_from_name
to compare the short name from a metadata sheet with all short names in the database. If there is a match, it skips the rest of the spreadsheet. Otherwise, it adds the sheet as normal.This is an imprecise solution meant to facilitate the creation of a new edited collection in the short-term. One major limitation of this solution is its inability to add new words or annotation layers to an existing document. In the future, we will likely want the possibility for more granular comparisons to allow merging of new info in sheets with existing info in the database