Hugging Face Video

This repository contains a simple commandline tool for uploading camera trap video data as a dataset to the Hugging Face (HF) platform. HF does not currently support video datasets in their container format (i.e., MP4). This tool converts video data into bytes and stores it in a column-oriented format (i.e., Parquet) to enable many of HF's attractive features (e.g., dataset streaming) for video. It also includes a Python package with IO-optimised transformations to extract frames (in torch.Tensor format) from bytes that are streamed as a dataset from the hub.

Installation

Install Anaconda. Then create a conda environment using the environment file (conda-environment.yml) using the following command.

conda env create --name envname
pip install -r requirements.txt

Usage

Examples of how to use the commandline tool and the transformations package. Since the tool was developed primarily for inference over video footage, please note that the commandline tool does not yet support the upload of the corresponding labels (where they exist).

Dataset Uploader

python upload_data.py --path_to_data=videos/ --repo_id=username/dataset # see cache dir for memory issues...

Doing this will create a dataset which is retrievable using the Hugging Face datasets library:

from datasets import load_dataset

dataset = load_dataset('username/dataset', streaming=True)

HF Video Transformation

Now you can use the hf_video package to retrieve videos as PyTorch tensors:

from hf_video import PanAfTransform
from torch.utils.data import DataLoader

# Initialise transform
my_transform = PanAfTransform(
    num_frames=32,
    image_size=224,
    ...
)

# Initialise loader
loader = DataLoader(dataset)

# Transform batched data
for batch in loader: # batches of {"video": bytes, "filename": name}
    t_batch = my_transform(batch) # batches of {"video": torch.Tensor, "filename": name}
    output = model(t_batch)
    ...

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
app		app
build/lib/hf_video		build/lib/hf_video
dist		dist
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hugging Face Video

Installation

Usage

Dataset Uploader

HF Video Transformation

About

Releases

Packages

Languages

License

wild-chimpanzee-foundation/hf_video

Folders and files

Latest commit

History

Repository files navigation

Hugging Face Video

Installation

Usage

Dataset Uploader

HF Video Transformation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages