OpenCitations Meta#
Quick start#
Install:
pip install oc_meta
Run the main processing pipeline:
python -m oc_meta.run.meta_process -c meta_config.yaml
Input format#
Meta expects CSV files with these columns:
Column |
Description |
|---|---|
|
Space-separated identifiers ( |
|
Title of the work |
|
Semicolon-separated names with optional identifiers ( |
|
ISO 8601 date ( |
|
Container title with optional identifier ( |
|
Volume number |
|
Issue number |
|
Page range ( |
|
Resource type ( |
|
Publisher name with optional identifier ( |
|
Same format as |
See the CSV format reference for the complete specification.
Documentation#
Configuration — YAML config file setup
Preprocessing — Filter and prepare input data
Processing — Run the main pipeline
Verification — Validate output
Editing entities — Modify existing RDF
Merge — Detect and merge duplicates
Info dir — Filesystem counter management
Benchmark — Performance measurement
Testing — Test infrastructure and fixtures