A dataspace is a logical grouping of data with definitions of how that data should be loaded and processed, usually from a single source.
A combination of its data source and its name identifies it, for example, nasdaq/msft
or twitter/tweets
.
The dataspaces node of the Spicepod manifest may hold one or more dataspace definitions. The runtime will merge data from all pod dataspaces into observations.
Each dataspace encapsulates definitions for its core data primitives, such as measurements (numerical data), categories (string-based categorical data), and tags (string-based tags). It also includes configuration for data connector and data processor components to load and process data into those primitives.
While measurements and categories are scoped to the dataspace namespace, tags are aggregated with tags from other dataspaces to the pod scope.
Dataspaces may load data through a data connector and processor or through the POST /pods/{pod}/observations API. The API accepts JSON or CSV data and uses the appropriate JSON or CSV data processor.
Data connectors and processors are community-maintained components for streaming and processing time-series data. More information can be found in the data-components-contrib repository.