contract.csv¶
CsvParams
dataclass
¶
CsvParams(
path: PathLike,
delimiter: Optional[str] = None,
strict_header: bool = True,
index: Optional[str] = None,
)
Bases: FilePathBaseParam
Parameters for CSV file processing.
Extends the base file path parameters with CSV-specific options.
Attributes:
Name | Type | Description |
---|---|---|
delimiter |
Optional[str]
|
Custom delimiter character for CSV parsing. If None, the default comma delimiter is used. |
strict_header |
bool
|
If True, treats the first row as a header. Otherwise, no header is assumed. |
index |
Optional[str]
|
Column name to set as the DataFrame index. If None, default numeric indices are used. |
Csv ¶
Csv(
name: str,
*,
description: Optional[str] = None,
reader_params: Optional[TReaderParams] = None,
**kwargs,
)
Bases: DataStream[DataFrame, CsvParams]
CSV file data stream provider.
A data stream implementation for reading CSV files into pandas DataFrames with configurable parameters for delimiter, header handling, and indexing.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
DataStream
|
Base class for data stream providers. |
required |
Examples:
from contraqctor.contract.csv import Csv, CsvParams
# Create and load a CSV stream
params = CsvParams(path="data/european_data.csv", delimiter=";")
csv_stream = Csv("measurements", reader_params=params)
csv_stream.load()
# Access the DataFrame
df = csv_stream.data
filtered = df[df["temperature"] > 25]
Source code in src/contraqctor/contract/base.py
33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 |
|
name
property
¶
name: str
Get the name of the data stream.
Returns:
Name | Type | Description |
---|---|---|
str |
str
|
Name identifier of the data stream. |
resolved_name
property
¶
resolved_name: str
Get the full hierarchical name of the data stream.
Generates a path-like name showing the stream's position in the hierarchy, using '::' as a separator between parent and child names.
Returns:
Name | Type | Description |
---|---|---|
str |
str
|
The fully resolved name including all parent names. |
description
property
¶
parent
property
¶
parent: Optional[DataStream]
Get the parent data stream.
Returns:
Type | Description |
---|---|
Optional[DataStream]
|
Optional[DataStream]: Parent data stream, or None if this is a root stream. |
is_collection
property
¶
is_collection: bool
Check if this data stream is a collection of other streams.
Returns:
Name | Type | Description |
---|---|---|
bool |
bool
|
True if this is a collection stream, False otherwise. |
reader_params
property
¶
reader_params: TReaderParams
Get the parameters for the data reader.
Returns:
Name | Type | Description |
---|---|---|
TReaderParams |
TReaderParams
|
Parameters for the data reader. |
has_data
property
¶
has_data: bool
Check if the data stream has loaded data.
Returns:
Name | Type | Description |
---|---|---|
bool |
bool
|
True if data has been loaded, False otherwise. |
data
property
¶
data: TData
Get the loaded data.
Returns:
Name | Type | Description |
---|---|---|
TData |
TData
|
The loaded data. |
Raises:
Type | Description |
---|---|
ValueError
|
If data has not been loaded yet. |
read ¶
read(
reader_params: Optional[TReaderParams] = None,
) -> TData
Read data using the configured reader.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
reader_params
|
Optional[TReaderParams]
|
Optional parameters to override the default reader parameters. |
None
|
Returns:
Name | Type | Description |
---|---|---|
TData |
TData
|
Data read from the source. |
Raises:
Type | Description |
---|---|
ValueError
|
If reader parameters are not set. |
Source code in src/contraqctor/contract/base.py
116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 |
|
bind_reader_params ¶
bind_reader_params(params: TReaderParams) -> Self
Bind reader parameters to the data stream.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
params
|
TReaderParams
|
Parameters to bind to the data stream's reader. |
required |
Returns:
Name | Type | Description |
---|---|---|
Self |
Self
|
The data stream instance for method chaining. |
Raises:
Type | Description |
---|---|
ValueError
|
If reader parameters have already been set. |
Source code in src/contraqctor/contract/base.py
133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 |
|
at ¶
at(name: str) -> DataStream
Get a child data stream by name.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
name
|
str
|
Name of the child data stream to retrieve. |
required |
Returns:
Name | Type | Description |
---|---|---|
DataStream |
DataStream
|
The child data stream with the given name. |
Raises:
Type | Description |
---|---|
NotImplementedError
|
If the data stream does not support child access. |
Examples:
# Access stream in a collection
collection = data_collection.load()
temp_stream = collection.at("temperature")
# Or using dictionary-style syntax
humidity_stream = collection["humidity"]
Source code in src/contraqctor/contract/base.py
150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 |
|
load ¶
load() -> Self
Load data into the data stream.
Reads data from the source and stores it in the data stream.
Returns:
Name | Type | Description |
---|---|---|
Self |
Self
|
The data stream instance for method chaining. |
Examples:
from contraqctor.contract import csv
# Create and load a CSV stream
params = csv.CsvParams(path="data/measurements.csv")
csv_stream = csv.Csv("measurements", reader_params=params)
csv_stream.load()
# Access the data
df = csv_stream.data
print(f"Loaded {len(df)} rows")
Source code in src/contraqctor/contract/base.py
208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 |
|
load_all ¶
load_all(
strict: bool = False,
) -> list[tuple[DataStream, Exception], None, None]
Recursively load this data stream and all child streams.
Performs depth-first traversal to load all streams in the hierarchy.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
strict
|
bool
|
If True, raises exceptions immediately; otherwise collects and returns them. |
False
|
Returns:
Name | Type | Description |
---|---|---|
list |
list[tuple[DataStream, Exception], None, None]
|
List of tuples containing streams and exceptions that occurred during loading. |
Raises:
Type | Description |
---|---|
Exception
|
If strict is True and an exception occurs during loading. |
Examples:
# Load all streams and handle errors
errors = collection.load_all(strict=False)
if errors:
for stream, error in errors:
print(f"Error loading {stream.name}: {error}")
Source code in src/contraqctor/contract/base.py
259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 |
|