SEARS SDK

The purpose of this SDK is to publish code that will help data scientists to query MongoDB using python so as to bulk download data and files directly from the SEARS backend for aggregated analysis. Case studies 6.1 and 6.2 from our main paper were conducted using this SDK.

Main SEARS platform

Please refer to our main SEARS platform repository here.

Steps to pull data.

Copy the .env file to the root directory of the project. Update the connection string to use your own MongoDB Atlas connection string. Also update the AWS S3 parameters as per your AWS settings.
Install all requirements using pip3 install -r requirements.txt
Run python3 mongo_connect.py to download data from MongoDB to a CSV file. Set search_criteria and output_file_name in the program file.
Run python3 AWS_Download.py to download files from AWS S3 to a local directory ./file_fetch/. All files related to experiments meeting the search criteria will be downloaded.
Run your ML model on the downloaded data and files.

Process to automate the upload of experiment data to MongoDB

#Steps

Notice the folder ./uploads in the root directory of the project. This folder is used to upload data to MongoDB.
Drop data for an experiment in the folder ./uploads. The data should be in the form of a JSON file.
Run the program python3 auto_upload.py to upload the data to MongoDB. The program will automatically upload the data to the MongoDB collection productData.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
__pycache__		__pycache__
file_fetch		file_fetch
logs		logs
.env		.env
.gitignore		.gitignore
AWS_Download.py		AWS_Download.py
LICENSE		LICENSE
ReadMe.md		ReadMe.md
auto_upload.py		auto_upload.py
logging_helper.py		logging_helper.py
mongo_connect.py		mongo_connect.py
playground-1.mongodb.js		playground-1.mongodb.js
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SEARS SDK

Main SEARS platform

Steps to pull data.

Process to automate the upload of experiment data to MongoDB

About

Releases

Packages

Languages

License

baskargroup/SEARS-Data-Pull

Folders and files

Latest commit

History

Repository files navigation

SEARS SDK

Main SEARS platform

Steps to pull data.

Process to automate the upload of experiment data to MongoDB

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages