Skip to content

Internet Archive

Create a YAML file with the following fields:

FieldDescription
identifierUnique item identifier
access_keyS3-like access key
secret_keyS3-like secret key
file_pathsList of files to upload
metadataItem metadata
FieldDescription
titleItem title
descriptionItem description
creatorCreator name
mediatypeType: data, texts, audio, video, etc.
collectionCollection name (e.g., opensource)
subjectTags separated by semicolons
datePublication date
languageLanguage code (e.g., eng)

Example:

identifier: my-dataset-2025
access_key: <YOUR_ACCESS_KEY>
secret_key: <YOUR_SECRET_KEY>
file_paths:
- /path/to/dataset.tar.gz
- /path/to/documentation.pdf
metadata:
title: My research dataset
description: Dataset containing research data
creator: Author Name
mediatype: data
collection: opensource
subject: research;data;science

See examples/internet_archive_upload.yaml for a complete example.

Terminal window
python -m piccione.upload.on_internet_archive config.yaml
  • MD5 verification
  • Automatic retry (3 retries with 10s delay)
  • Progress bar