You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have searched the existing issues, and I could not find an existing issue for this feature
I am requesting a straightforward extension of existing dbt-bigquery functionality, rather than a Big Idea better suited to a discussion
Issue Overview
Following this discussion, I am raising this issue to address the lack of descriptions in the generated source.yml file.
Current Behaviour
When using codgen to generate a source.yml file with the list of columns and tables in my datasource, the description field is empty for both tables and columns.
Expected Behaviour
The descriptions of columns and tables in BigQuery should be parsed automatically and included in the generated source.yml.
Why This Relates to dbt-bigquery and Not codegen
The codegen package cannot include descriptions because the description field is not retrieved by dbt. This appears to be due to the BigqueryColumn class not having a description attribute.
Proposed Solution
When calling the BigQuery API in the get_table function from the Bigquery Client API here, retrieve the table and column descriptions from the returned table object. This will make the descriptions available in the Relation class for use by codegen.
Steps to Reproduce
Set up a BigQuery datasource with tables and columns that have descriptions.
Run dbt codegen generate to create a source.yml file.
Write a custom script that queries the table's schema from GCP BigQuery API and parse the returned result in source.yml file -> This solution requires too much verbose and non reusable code
Use the INFORMATION_SCHEMA as suggested here ->But this requires more permissions and it is very repetitive to write the name of every column
Who will this benefit?
Every user of dbt-BigQuery working with very large data sources, such as SAP tables with hundreds of columns. Personnaly I had to do a migration of SAP tables to DBT it was a very repetitive work to copy all the tables, with their columns in the source.yml file.
Are you interested in contributing this feature?
Yes
Anything else?
No response
The text was updated successfully, but these errors were encountered:
Is this your first time submitting a feature request?
Issue Overview
Following this discussion, I am raising this issue to address the lack of descriptions in the generated
source.yml
file.Current Behaviour
When using codgen to generate a
source.yml
file with the list of columns and tables in my datasource, thedescription
field is empty for both tables and columns.Expected Behaviour
The descriptions of columns and tables in BigQuery should be parsed automatically and included in the generated
source.yml
.Why This Relates to
dbt-bigquery
and Notcodegen
The
codegen
package cannot include descriptions because thedescription
field is not retrieved by dbt. This appears to be due to the BigqueryColumn class not having adescription
attribute.Proposed Solution
When calling the BigQuery API in the get_table function from the Bigquery Client API here, retrieve the table and column descriptions from the returned
table
object. This will make the descriptions available in theRelation
class for use bycodegen
.Steps to Reproduce
dbt codegen generate
to create asource.yml
file.description
fields are empty.Expected Output
An example of the expected
source.yml
file:Describe alternatives you've considered
There are two alternatives to this issue:
Who will this benefit?
Every user of dbt-BigQuery working with very large data sources, such as SAP tables with hundreds of columns. Personnaly I had to do a migration of SAP tables to DBT it was a very repetitive work to copy all the tables, with their columns in the source.yml file.
Are you interested in contributing this feature?
Yes
Anything else?
No response
The text was updated successfully, but these errors were encountered: