add content, unfinished

zzeppozz · zzeppozz · commit 2a786030d2d0 · 2025-02-25T17:11:41.000-06:00
diff --git a/sphinx/software_desc/glossary.rst b/sphinx/software_desc/glossary.rst
@@ -0,0 +1,73 @@
+Glossary
+#############
+
+Specify-defined terms
+
+
+.. glossary::
+
+  Algorithm
+    An algorithm is a procedure or formula for solving a problem.  There are  multiple
+    algorithms for computing Species Distribution Models (SDM) which
+    define the relationship between a set of points and the environmental values
+    at those points.
+
+  Container
+    A :term:`Docker` instance which runs as an application on a :term:`Host machine`.
+    The Docker container contains all software dependencies required by the programs it
+    will run.
+
+  CSV
+    CSV (Comma Separated Values) is a file format for records, in which fields are
+    separated by a delimiter.  Commas and tabs are common, but other characters may
+    be used as delimiters.
+
+  Data Catalog
+    The Specify database of tables and fields containing all data related to one or
+    more Collections.
+
+  Data Validation
+    Testing processes to run on data values ensuring that they meet the conditions
+    defined for the field, table, and database.  This could include data type,
+    formatting, content restrictions such as a controlled vocabulary, numeric range,
+    or existence of an database identifier.
+
+  Docker
+    Docker is an application which can run on Linux, MacOSX, or Windows.  With a
+    Docker-ized application, such as this tutorial, a user can run the application on
+    their local machine in a controlled and sequestered environment, with a set of
+    dependencies that may not be easy, allowed, or even available for their local
+    machine.
+
+  Docker image
+    A Docker-ized application, built into a single package, with all required
+    software dependencies and files.
+
+  DwCA
+    DwCA (Darwin Core Archive) is a packaged dataset of occurrence records in `Darwin
+    Core standard <https://www.tdwg.org/standards/dwc/>`_ format, along with metadata
+    about the contents.
+
+  Host machine
+    A physical or virtual machine on which Docker can be run.
+
+  Mapped Spreadsheet
+    A spreadsheet that has a mapping document that matches spreadsheet columns to
+    Specify database fields.
+
+  Mapping Template
+    A document that matches terms in a :term:`Mapped Spreadsheet` to fields in the
+    Specify database.
+
+  Occurrence
+    An occurrence is a record of a specimen occurrence including metadata about the
+    specimen and the spatial location where it was found.
+
+  Occurrence Data
+    Point data representing specimens collected for a single species or taxon.  Each
+    data point
+    contains a location, x and y, in some known geographic spatial reference system.
+
+  Tree
+    A Tree is a set of hierarchical data.  Several tables in Specify are defined as
+    trees:  Taxonomy, Geography, Storage Location.
diff --git a/sphinx/software_desc/migration.rst b/sphinx/software_desc/migration.rst
@@ -25,7 +25,24 @@ via Specify’s Workbench tool, Specify’s API, or with SQL scripts directly in
 backend Specify database (MariaDB).  SCC staff would work with the Migration team to
 choose specific processing options and methods for data migrations.
 
-SCC technical staff will train the Point Person and Migration team involved in database
-setup and data migration to understand Specify at the Support level. The contacts should plan to allocate a week to visit our SCC headquarters in Kansas. There they would work intensively one-on-one with our technical support staff and software engineers to attain a database administrator level of mastery. After the visit, SCC staff will continue to meet with the person or team as needed over Zoom to discuss questions and to research and resolve issues that arise. The Buyer would be responsible for their staff travel expenses, the SCC will allocate staff time and project resources at no cost.
-We can facilitate meetings with other large national organizations who have undergone the same process of assessing collections’ requirements, deciding on configuration and customization options, preparing data for migration, and then importing data into Specify. We have worked with several organizations of a similar size and scale in transitions to Specify including: the Danish Natural History Museums, the Canadian Laurentian Forestry Centre, and the Australian federal government’s CSIRO. Each member has taken a somewhat custom transition to move to Specify based on the organization of local technical expertise and desired outcomes.
+SCC technical staff will train a Point Person and Migration team involved in database
+setup and data migration to understand Specify at the Support level. These individuals
+should plan to allocate a week to visit SCC headquarters in Kansas. They will work
+one-on-one with SCC technical support staff and software engineers to attain a
+database administrator level of mastery. After the visit, SCC staff will continue to
+meet with the person or team as needed over Zoom to discuss questions and to research
+and resolve issues that arise.
+
+Members are responsible for their staff travel expenses,
+the SCC will allocate staff time and project resources at no cost.
+SCC can facilitate meetings with other large national organizations who have undergone
+similar processes of assessing collections’ requirements, deciding on configuration
+and customization options, preparing data for migration, and importing data into
+Specify.
+
+SCC has worked with several large-scale organizations in transitions to
+Specify, including: the Danish Natural History Museums, the Canadian Laurentian
+Forestry Centre, and the Australian federal government’s CSIRO. Each member has taken
+a custom transition to move to Specify based on technical expertise and desired
+outcomes.
 
diff --git a/sphinx/software_desc/workflows.rst b/sphinx/software_desc/workflows.rst
@@ -0,0 +1,75 @@
+Common Workflows
+##################
+
+Field collection of specimens
+******************************************
+
+Specify contains a table for Permits, which can be configured at the Institution level
+to be associated with **Accession**, **Collecting Event**, or **Collecting Trip**
+("Permit Associated Record") allowing the user to document permits
+acquired for collecting and cataloging specimens.  Either the Permit or the
+Permit Associated Record can be created first, and either can be linked to an
+existing record.
+
+SCC recommends sending researchers out into the field with a spreadsheet that has a
+"Mapping Template" that matches spreadsheet columns to Specify database fields.  The
+spreadsheet for data entry is referred to as a "Mapped Spreadsheet".
+Mapped Spreadsheets can be created for
+research expeditions, focusing on data collection specific to that field trip.
+Researchers can easily use the spreadsheet in the field to record information about
+specimens, collecting event(s), locality, and more.
+
+Data entered into a Mapped Spreadsheet can be imported via the Specify Workbench, a
+spreadsheet-based application. In this workflow, the user chooses the correct Mapping
+Template, the uploads the Mapped Spreadsheet to the Workbench. At this stage, the
+Workbench is completely external to the data catalog.  The Workbench contains
+extensive matching and editing features that can be edited to fit user needs.
+The Workbench then performs Data Validation on spreadsheet contents before submitting
+the data for upload to the Data Catalog. 
+
+It allows the user to bring in bulk
+data, match columns to fields in Specify, and perform basic data integrity checks to
+ensure the data matches database requirements (data validity, controlled vocabulary
+matching, and linked record matching, such as Agent or Taxon records).
+
+Alternatively, a researcher could create a custom spreadsheet and simply create a mapping template before importing data to Specify.
+
+Users first validate the spreadsheet data within the Workbench, then upload the verified data to the database.  Users may be assigned different levels of access to the Workbench functionality, such as permission to validate a dataset with the Workbench, or perform the upload, so different people may verify that the data is sound.
+
+Because Specify 7 is an online software, if there is internet access there is the option to create a dataset directly in the Specify Workbench from the field.  This workflow allows researchers to verify data against database requirements on entry.
+
+More information on our Workbench application is available here: https://discourse.specifysoftware.org/t/the-specify-7-workbench/540
+
+Object entry
+******************************************
+The user has three ways to document information pre-cataloging: 1) record the data in the Specify Workbench, 2) enter it into the database and mark it as different from cataloged data, or 3) put the data into a separate collection and bring the data back over once it is ready to be cataloged.
+If the data is in the Workbench, it is segregated from queries and exports, search boxes which search for existing records to match (such as Agent or Taxon records) do not find this information.  However, that means it may not be linked to Loans or any other interactions.
+A user can have multiple active, not uploaded datasets, but datasets are uploaded as a file, meaning all items in a dataset are uploaded at the same time.  If you have cataloged only a portion of the items in the dataset, you will either have to wait for the rest to be cataloged before uploading or move the uncataloged records to another dataset.
+If the data is added to the database but marked as separate, the data would be included in any search box searches and could be included in Loans or other Interactions.  The data could be excluded from queries or data exports by adding the field used to mark it as separate.  The user could have a checkbox field to indicate if it is cataloged, or use the lack of a catalog number as an indicator.
+If the data is added to a separate collection in the same database, the data would be completely separate from the cataloged data, but could be treated as a collection object, meaning all database and curatorial treatments could be documented.  A script could be set to run automatically to bring the data over from the separate collection to the cataloged collection, ideally using the Specify API, each night for records that meet a requirement, such as a catalog number has been added, indicating it has been cataloged.
+Each of these approaches to pre-cataloged data has been successfully used by existing SCC members.
+
+Acquisition and accessioning
+******************************************
+
+Location and movement control
+******************************************
+
+Cataloguing
+******************************************
+
+Loans in (borrowing objects)
+******************************************
+
+Loans out (lending objects)
+******************************************
+
+Use of collections
+******************************************
+
+Condition checking and improvement
+******************************************
+
+Deaccessioning and disposal
+******************************************
+