Storage devices provide the underlying data storage environment for persisting the datasets that are processed by machine learning solutions. A storage device can exist as a distributed file system or a database.

Distributed file systems can be used for persisting immutable data that is intended for streaming access or batch processing. Databases, such as NoSQL repositories, can be used for structured and unstructured storage and read/write data access.











Structured data is imported into a storage device (1) using a data transfer engine (2). Unstructured data is imported (3) using another type of data transfer engine (4).

