The idea of this site is to collect various file formats used in materials science and chemistry, describe them with metadata, and provide links to software projects that can parse them.

By providing this data in a web API, it hoped that users can discover new extractors more easily and metadata standards can be developed for the output of extractors to enable schemas to proliferate throughout the field.

This Registry accompanies the MaRDA Metadata Extractors Schemas ( repository) and API to enable programmatic use and discovery of extractors.

This project proof-of-concept project was devised by the MaRDA Metadata Extractors Working Group, and further discussions and instructions for how to get involved can be found at the discussions meta-repository.

The registry and the data within it is available under the terms of the MIT license.


Anyone can contribute file type and extractor entries to this registry by following the instructions in the contributing guidelines on GitHub.

After submitting a pull request, this data will be validated and added to the deployed database once it is merged.