The standard library of many programming languages includes a "scanner API" to extract strings, numbers, or other objects from text input streams. (For example, Java includes the Scanner
class, C++ includes istream
, and C includes scanf
).
What is the equivalent of this in Python?
Python has a stream interface, i.e. classes that inherit from io.IOBase
. However, the Python TextIOBase
stream interface only provides facilities for line-oriented input. After reading the documentation and searching on Google, I can't find something in the standard Python modules that would let me, for example, extract an integer from a text stream, or extract the next space-delimited word as a string. Are there any standard facilities to do this?
There is no direct equivalent (as far as I know). However, you can do pretty much the same thing with regular expressions (see the
re
module).For instance:
It requires a little more work than the usual C-style scanner interface, but it is also very flexible and powerful. You will have to process stream I/O yourself though.
There is no equivalent of
fscanf
or Java'sScanner
. The simplest solution is to require the user to use newline separeted input instead of space separated input, you can then read line by line and convert the lines to the correct type.If you want the user to provide more structured input then you probably should create a parser for the user input. There are some nice parsing libraries for python, for example pyparsing. There is also a
scanf
module, even though the last update is of 2008.If you don't want to have external dependencies then you can use regexes to match the input sequences. Certainly regexes require to work on strings, but you can easily overcome this limitation reading in chunks. For example something like this should work well most of the time:
Example usage:
You'll probably have to extend this, and test it properly but it should give you some ideas.