Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Added note on available datatypes for DP1 that we will handle #5

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

stvoutsin
Copy link
Member

No description provided.

@stvoutsin stvoutsin requested a review from gpdf April 3, 2025 00:17
Note that more complex types like arrays of numerical types are not on our schedule for the initial release (DP1), but we may decide to
extend our capabilities for handling these types if it becomes a requirement for future releases.


Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

How about timestamps? When I was looking into the Parquet files generated by the pipelines, I found 5 columns where the timestamps were encoded:

  • Table Visit has two columns: expMidpt and obsStart
  • Table CcdVisit has two columns: expMidpt and obsStart
  • Table DiaSource has one column time_processed

The Parquet low-level schema for the columns is:

% parq diaSourceTable_tract_10463_lsst_cells_v1_LSSTComCam_runs_DRP_DP1_w_2025_09_DM-49235_20250303T090128Z.parq --schema  | grep Timestamp

  optional int64 field_id=-1 time_processed (Timestamp(isAdjustedToUTC=false, timeUnit=nanoseconds, is_from_converted_type=false, force_set_converted_type=false));

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Forgot to mention table ObsCore. It has a couple of interesting SQL types:

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I should maybe clarify this in the document, but the types I was referring to are what the IVOA datatypes, i..e what appear in the VOTable & TAP_SCHEMA.

For timestamps for example, we currently output as a char (). So Russ will have to format whatever the encoding is in QServ into a char(). Similar for the Obscore table, we map those complicated SQL types into varchars.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants