You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardexpand all lines: portals.qmd
+62-120
Original file line number
Diff line number
Diff line change
@@ -1,6 +1,4 @@
1
-
---
2
-
title: "Portals"
3
-
---
1
+
# Portals
4
2
5
3
## Overview
6
4
@@ -11,85 +9,63 @@ CalCOFI data is available through various portals, each serving different purpos
11
9
While it would be ideal for CalCOFI data to be available through a single portal, each portal has its strengths and limitations. The following diagram illustrates one possible realization of data flow between CalCOFI data and the portals: from raw data to the integrated database to portals and meta-portals.
style meta fill:#F8FAFC,stroke:#CBD5E1,stroke-width:2px
12
+
%%| label: fig-portals-flow
13
+
%%| fig-cap: "Flow of data from raw to database to portals and meta-portals."
14
+
%%| file: diagrams/portals_flow.mmd
15
+
%%| fig-width: 6
75
16
```
76
17
77
18
In practice, CalCOFI is a partnership with various contributing members, so the authoritative dataset might flow differently, such as from EDI to the database to the other portals. The other portals, such as OBIS or ERDDAP, serve different audiences or purposes. The meta-portals like ODIS and Data.gov then index these portals to provide broader discovery of CalCOFI datasets.
78
19
79
20
## Portals
80
21
81
-
| Portal | Full Archive | Versioning | DOI issued | Query by xyt | Query by taxa | Multiple formats | API Access |
† Capability: :red_circle: none; :yellow_circle: partial; :green_circle: full
22
+
While some portals serve as data repositories, others provide advanced data access and visualization tools. The following sections describe the main portals where CalCOFI data is available and their key features.
23
+
24
+
```{r}
25
+
#| label: tbl-portals
26
+
#| tbl-cap: Portal Capabilities.
27
+
28
+
librarian::shelf(
29
+
dplyr, gt, here, readr, scales)
30
+
31
+
portals_csv <- here("data/portal_comparison.csv")
32
+
33
+
# Define the capability symbols and colors
34
+
cap_symbols <- c(
35
+
"none" = "\u2716", # ✖
36
+
"partial" = "\u25B2", # ▲
37
+
"full" = "\u2714") # ✔
38
+
cap_colors <- c(
39
+
"none" = "#FFCCCC", # Light red
40
+
"partial" = "#FFF2CC", # Light yellow
41
+
"full" = "#CCFFCC") # Light green
42
+
43
+
# Create the gt table
44
+
read_csv(portals_csv, show_col_types = F) |>
45
+
gt(rowname_col = "Portal") |>
46
+
# Apply background colors based on capability levels
@@ -147,54 +123,20 @@ In practice, CalCOFI is a partnership with various contributing members, so the
147
123
The [Ecological Metadata Language (EML)](https://eml.ecoinformatics.org/) (and using R package [EML](https://docs.ropensci.org/EML/) in workflows) serves as a key standard for describing ecological and environmental data. For CalCOFI, EML metadata files are generated alongside data files, providing structured documentation that enables interoperability across different data portals. This metadata-driven approach allows automated ingestion into various data systems while maintaining data integrity and provenance.
%%| fig-cap: "Metadata in the form of ecological metadata language (EML) is used to describe the dataset in a consistent manner that can be ingested by the portals."
128
+
%%| file: diagrams/portals_metadata.mmd
129
+
%%| fig-width: 6
188
130
```
189
131
190
132
The EML specification provides detailed structure for describing datasets, including:
191
133
192
-
- Dataset identification and citation
193
-
- Geographic and temporal coverage
194
-
- Variable definitions and units
195
-
- Methods and protocols
196
-
- Quality control procedures
197
-
- Access and usage rights
134
+
-Dataset identification and citation
135
+
-Geographic and temporal coverage
136
+
-Variable definitions and units
137
+
-Methods and protocols
138
+
-Quality control procedures
139
+
-Access and usage rights
198
140
199
141
This standardized metadata enables automated data transformation and ingestion into various portal systems while preserving the original data context and quality information.
0 commit comments