[DataCap Application] Arbitr Labs - StorageBites

### Data Owner Name

Arbitr Labs LLC

### Data Owner Country/Region

United States

### Data Owner Industry

IT & Technology Services

### Website of the dataowner

https://sites.google.com/view/arbitr-labs/home

### Social Media Handle

Arbitr Labs

### Social Media Type

Slack

### What is your role related to the dataset

Data onramp entity that provides data onboarding services to multiple clients

### Total amount of DataCap being requested

50TiB

### Expected size of single dataset (one copy)

25TiB

### Number of replicas to store

2

### Weekly allocation of DataCap requested

5TiB

### On-chain address for first allocation

f410fzoxsvauqj7ouwtjksr7hz2oaacnhtj4puvajsky

### Data Type of Application

Encrypted, private data

### Manifest location

- [ ] Accessible through the official Filecoin dataset discovery (Currently toads.directory, must be prepared according to TOADS standards)
- [ ] Accessible through another website (give details below)
- [x] This won't be publicly accessible

### URL under which the dataset will be discoverable

_No response_

### What type of payment proof will you provide?

Filecoin txID

### What retrievability guarantees do you expect for this dataset?

Hot - data should be always available for retrievals (RPA 90%+)

### How long should a download take?

Normal - sustained 300 Mbps.

### How often will this dataset be accessed?

Weekly

### For how long do you plan to keep this dataset stored on Filecoin

1.5 to 2 years

### Share a brief history of your project and organization

```text
StorageBites is a data onramp platform by ArbitrLabs LLC that uses Filecoin as its durable storage layer.
```

### Is this project associated with other projects/ecosystem stakeholders?

No

### If answered yes, what are the other projects/ecosystem stakeholders

```text

```

### Describe the data being stored onto Filecoin

```text
private datasets
```

### Where was the data currently stored in this dataset sourced from

Other

### If you answered "Other" in the previous question, enter the details here

```text
Cloudflare R2
```

### If you are a data preparer. What is your location (Country/Region)

United States

### If you are a data preparer, how will the data be prepared? Please include tooling used and technical details?

```text
Data is ingested via S3-compatible API and converted into CAR files by the StorageBites pipeline.
```

### If you are not preparing the data, who will prepare the data?  (Provide name and business)

```text

```

### Has this dataset been stored on the Filecoin network before? If so, please explain and make the case why you would like to store this dataset again to the network. Provide details on preparation and/or SP distribution.

```text
No, the data has not been stored on the Filecoin network before.
```

### Please share a sample of the data

```text
The data is private
```

### Confirm that this is a public dataset that can be retrieved by anyone on the Network

- [ ] I confirm

### If you chose not to confirm, what was the reason

```text
These datasets are not public
```

### In which geographies do you plan on making storage deals

North America, Greater China, Europe, South America

### How will you be distributing your data to storage providers

Cloud storage (i.e. S3)

### How did you find your storage providers

Others

### If you answered "Others" in the previous question, what is the tool or platform you used

```text
SP discovery via IPNI index and custom retrieval probes
```

### Please list the provider IDs and location of the storage providers you will be working with.

```text
1. f01518369, United States
2. f03373373, China
3. f03559995, Hong Kong
4. f03619150, Germany
5. f03673681, Brazil
```

### How do you plan to make deals to your storage providers

Boost client

### If you answered "Others/custom tool" in the previous question, enter the details here

```text

```

[DataCap Application] Arbitr Labs - StorageBites #145

Description

Data Owner Name

Data Owner Country/Region

Data Owner Industry

Website of the dataowner

Social Media Handle

Social Media Type

What is your role related to the dataset

Total amount of DataCap being requested

Expected size of single dataset (one copy)

Number of replicas to store

Weekly allocation of DataCap requested

On-chain address for first allocation

Data Type of Application

Manifest location

URL under which the dataset will be discoverable

What type of payment proof will you provide?

What retrievability guarantees do you expect for this dataset?

How long should a download take?

How often will this dataset be accessed?

For how long do you plan to keep this dataset stored on Filecoin

Share a brief history of your project and organization

Is this project associated with other projects/ecosystem stakeholders?

If answered yes, what are the other projects/ecosystem stakeholders

Describe the data being stored onto Filecoin

Where was the data currently stored in this dataset sourced from

If you answered "Other" in the previous question, enter the details here

If you are a data preparer. What is your location (Country/Region)

If you are a data preparer, how will the data be prepared? Please include tooling used and technical details?

If you are not preparing the data, who will prepare the data? (Provide name and business)

Has this dataset been stored on the Filecoin network before? If so, please explain and make the case why you would like to store this dataset again to the network. Provide details on preparation and/or SP distribution.

Please share a sample of the data

Confirm that this is a public dataset that can be retrieved by anyone on the Network

If you chose not to confirm, what was the reason

In which geographies do you plan on making storage deals

How will you be distributing your data to storage providers

How did you find your storage providers

If you answered "Others" in the previous question, what is the tool or platform you used

Please list the provider IDs and location of the storage providers you will be working with.

How do you plan to make deals to your storage providers

If you answered "Others/custom tool" in the previous question, enter the details here

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions