Input/Output

Data scientists often work with files and other data sources. I/O is needed for reading from the data sources and writing the results back. The Java I/O API provides two main types of abstraction for this:

  • InputStream, OutputStream for binary data
  • Reader, Writer for text data

Typical data science applications deal with text rather than raw binary data--the data is often stored in TXT, CSV, JSON, and other similar text formats. This is why we will concentrate on the second part.