Culture Knowledge Graph Kitchen |
The Culture Knowledge Graph Kitchen is a versatile ETL (Extract-Transform-Load) environment designed for efficient consumption, processing, integration, and analysis of data feeds into the Culture Knowledge Graph. It comprises several Python components, each serving a unique purpose within the "kitchen" to consume, clean, version control, publish and analyze data. |
Culture Graph Interchange Format |
The Culture Graph Interchange Format (CGIF) is an easy-to-use, lightweight interchange format based on schema.org for the harvesting of data. CGIF has the added benefit of automatically making the data eligible for Google Dataset Search and to significantly improve the findability of websites and datasets through search engine optimization. |
Data feed |
Following schema.org, a data feed provides structured information about one or more entities or topics (see https://schema.org/DataFeed). A data feed may also be known as a dataset. |
Data feed element |
Following schema.org, a data feed element is an item within in a data feed. Several elements can be contained in a data feed (see https://schema.org/dataFeedElement). |
Data portal |
Following the definition of the NFDIcore Ontology, a data portal is a website that serves as a centralised platform for accessing, managing, and sharing datasets, information, or resources related to a specific topic, theme, or domain (see https://nfdi.fiz-karlsruhe.de/ontology/NFDI_0000123). |
Hydra Scraper |
The Hydra Scraper is a command-line tool to pull data from various sources, such as Hydra-paginated APIs, Beacon-like URL list, ZIP files, or local file dumps. It can transform files in RDF-compatible formats such as JSON-LD or Turtle, but it can also handle, for example, LIDO files. Command-line calls can be combined and adapted to build fully-fledged scraping mechanisms, including the ability to output a set of triples, which is used to harvest data for the Culture Knowledge Graph. |
NFDIcore Ontology |
The NFDIcore ontology (prefix nfdicore ) describes resources such as datasets, data providers, persons, projects and other entities in the data domain of NFDI. It serves as the basis for further domain-specific ontology modules. Mappings to numerous external vocabularies and ontologies are provided in an extra file. |
NFDI4Culture Ontology |
The NFDI4Culture ontology module (prefix cto ) handles the representation and categorization of various resources within the domains of NFDI4Culture. It encompasses a wide array of entities including individual source items and properties for e.g. related persons, organizations, locations, data concepts, calssifiers and time information. CTO builds upon the NFDIcore Ontology. |