DataIterator API#
DataIterator#
- class ray.data.DataIterator[source]#
- An iterator for reading records from a - Dataset.- For Datasets, each iteration call represents a complete read of all items in the Dataset. - If using Ray Train, each trainer actor should get its own iterator by calling - ray.train.get_dataset_shard("train").- Examples - >>> import ray >>> ds = ray.data.range(5) >>> ds Dataset(num_rows=5, schema={id: int64}) >>> ds.iterator() DataIterator(Dataset(num_rows=5, schema={id: int64})) 
| Return a batched iterable over the dataset. | |
| Return a local row iterable over the dataset. | |
| Return a batched iterable of Torch Tensors over the dataset. | |
| Execute and materialize this data iterator into object store memory. | |
| Returns a string containing execution timing information. | |
| Return a TF Dataset over this dataset. |