Tutorial 71 80
Tutorial 71 80
Other types of objects need to be converted – either to a string (in text mode) or a bytes object (in binary mode) – before
writing them:
f.tell() returns an integer giving the file object’s current position in the file represented as number of bytes from the
beginning of the file when in binary mode and an opaque number when in text mode.
To change the file object’s position, use f.seek(offset, whence). The position is computed from adding offset
to a reference point; the reference point is selected by the whence argument. A whence value of 0 measures from the
beginning of the file, 1 uses the current file position, and 2 uses the end of the file as the reference point. whence can be
omitted and defaults to 0, using the beginning of the file as the reference point.
In text files (those opened without a b in the mode string), only seeks relative to the beginning of the file are allowed (the
exception being seeking to the very file end with seek(0, 2)) and the only valid offset values are those returned from
the f.tell(), or zero. Any other offset value produces undefined behaviour.
File objects have some additional methods, such as isatty() and truncate() which are less frequently used; consult
the Library Reference for a complete guide to file objects.
Strings can easily be written to and read from a file. Numbers take a bit more effort, since the read() method only
returns strings, which will have to be passed to a function like int(), which takes a string like '123' and returns
its numeric value 123. When you want to save more complex data types like nested lists and dictionaries, parsing and
serializing by hand becomes complicated.
Rather than having users constantly writing and debugging code to save complicated data types to files, Python allows
you to use the popular data interchange format called JSON (JavaScript Object Notation). The standard module called
json can take Python data hierarchies, and convert them to string representations; this process is called serializing.
Reconstructing the data from the string representation is called deserializing. Between serializing and deserializing, the
string representing the object may have been stored in a file or data, or sent over a network connection to some distant
machine.
Note: The JSON format is commonly used by modern applications to allow for data exchange. Many programmers are
already familiar with it, which makes it a good choice for interoperability.
If you have an object x, you can view its JSON string representation with a simple line of code:
Another variant of the dumps() function, called dump(), simply serializes the object to a text file. So if f is a text file
object opened for writing, we can do this:
json.dump(x, f)
To decode the object again, if f is a binary file or text file object which has been opened for reading:
x = json.load(f)
Note: JSON files must be encoded in UTF-8. Use encoding="utf-8" when opening JSON file as a text file for
both of reading and writing.
This simple serialization technique can handle lists and dictionaries, but serializing arbitrary class instances in JSON
requires a bit of extra effort. The reference for the json module contains an explanation of this.
See also:
pickle - the pickle module
Contrary to JSON, pickle is a protocol which allows the serialization of arbitrarily complex Python objects. As such, it is
specific to Python and cannot be used to communicate with applications written in other languages. It is also insecure by
default: deserializing pickle data coming from an untrusted source can execute arbitrary code, if the data was crafted by
a skilled attacker.
EIGHT
Until now error messages haven’t been more than mentioned, but if you have tried out the examples you have probably
seen some. There are (at least) two distinguishable kinds of errors: syntax errors and exceptions.
Syntax errors, also known as parsing errors, are perhaps the most common kind of complaint you get while you are still
learning Python:
The parser repeats the offending line and displays a little ‘arrow’ pointing at the earliest point in the line where the error
was detected. The error is caused by (or at least detected at) the token preceding the arrow: in the example, the error is
detected at the function print(), since a colon (':') is missing before it. File name and line number are printed so
you know where to look in case the input came from a script.
8.2 Exceptions
Even if a statement or expression is syntactically correct, it may cause an error when an attempt is made to execute it.
Errors detected during execution are called exceptions and are not unconditionally fatal: you will soon learn how to handle
them in Python programs. Most exceptions are not handled by programs, however, and result in error messages as shown
here:
>>> 10 * (1/0)
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
ZeroDivisionError: division by zero
>>> 4 + spam*3
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
NameError: name 'spam' is not defined
>>> '2' + 2
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
TypeError: can only concatenate str (not "int") to str
67
Python Tutorial, Release 3.12.2
The last line of the error message indicates what happened. Exceptions come in different types, and the type is printed as
part of the message: the types in the example are ZeroDivisionError, NameError and TypeError. The string
printed as the exception type is the name of the built-in exception that occurred. This is true for all built-in exceptions,
but need not be true for user-defined exceptions (although it is a useful convention). Standard exception names are built-in
identifiers (not reserved keywords).
The rest of the line provides detail based on the type of exception and what caused it.
The preceding part of the error message shows the context where the exception occurred, in the form of a stack traceback.
In general it contains a stack traceback listing source lines; however, it will not display lines read from standard input.
bltin-exceptions lists the built-in exceptions and their meanings.
It is possible to write programs that handle selected exceptions. Look at the following example, which asks the user for
input until a valid integer has been entered, but allows the user to interrupt the program (using Control-C or whatever
the operating system supports); note that a user-generated interruption is signalled by raising the KeyboardInterrupt
exception.
A class in an except clause is compatible with an exception if it is the same class or a base class thereof (but not the
other way around — an except clause listing a derived class is not compatible with a base class). For example, the following
code will print B, C, D in that order:
class B(Exception):
pass
class C(B):
pass
(continues on next page)
class D(C):
pass
Note that if the except clauses were reversed (with except B first), it would have printed B, B, B — the first matching
except clause is triggered.
When an exception occurs, it may have associated values, also known as the exception’s arguments. The presence and
types of the arguments depend on the exception type.
The except clause may specify a variable after the exception name. The variable is bound to the exception instance which
typically has an args attribute that stores the arguments. For convenience, builtin exception types define __str__()
to print all the arguments without explicitly accessing .args.
>>> try:
... raise Exception('spam', 'eggs')
... except Exception as inst:
... print(type(inst)) # the exception type
... print(inst.args) # arguments stored in .args
... print(inst) # __str__ allows args to be printed directly,
... # but may be overridden in exception subclasses
... x, y = inst.args # unpack args
... print('x =', x)
... print('y =', y)
...
<class 'Exception'>
('spam', 'eggs')
('spam', 'eggs')
x = spam
y = eggs
The exception’s __str__() output is printed as the last part (‘detail’) of the message for unhandled exceptions.
BaseException is the common base class of all exceptions. One of its subclasses, Exception, is the base class of
all the non-fatal exceptions. Exceptions which are not subclasses of Exception are not typically handled, because they
are used to indicate that the program should terminate. They include SystemExit which is raised by sys.exit()
and KeyboardInterrupt which is raised when a user wishes to interrupt the program.
Exception can be used as a wildcard that catches (almost) everything. However, it is good practice to be as specific as
possible with the types of exceptions that we intend to handle, and to allow any unexpected exceptions to propagate on.
The most common pattern for handling Exception is to print or log the exception and then re-raise it (allowing a caller
to handle the exception as well):
import sys
try:
f = open('myfile.txt')
(continues on next page)
The try … except statement has an optional else clause, which, when present, must follow all except clauses. It is
useful for code that must be executed if the try clause does not raise an exception. For example:
for arg in sys.argv[1:]:
try:
f = open(arg, 'r')
except OSError:
print('cannot open', arg)
else:
print(arg, 'has', len(f.readlines()), 'lines')
f.close()
The use of the else clause is better than adding additional code to the try clause because it avoids accidentally catching
an exception that wasn’t raised by the code being protected by the try … except statement.
Exception handlers do not handle only exceptions that occur immediately in the try clause, but also those that occur inside
functions that are called (even indirectly) in the try clause. For example:
>>> def this_fails():
... x = 1/0
...
>>> try:
... this_fails()
... except ZeroDivisionError as err:
... print('Handling run-time error:', err)
...
Handling run-time error: division by zero
The raise statement allows the programmer to force a specified exception to occur. For example:
>>> raise NameError('HiThere')
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
NameError: HiThere
The sole argument to raise indicates the exception to be raised. This must be either an exception instance or an exception
class (a class that derives from BaseException, such as Exception or one of its subclasses). If an exception class
is passed, it will be implicitly instantiated by calling its constructor with no arguments:
raise ValueError # shorthand for 'raise ValueError()'
If you need to determine whether an exception was raised but don’t intend to handle it, a simpler form of the raise
statement allows you to re-raise the exception:
>>> try:
... raise NameError('HiThere')
... except NameError:
... print('An exception flew by!')
... raise
...
An exception flew by!
Traceback (most recent call last):
File "<stdin>", line 2, in <module>
NameError: HiThere
If an unhandled exception occurs inside an except section, it will have the exception being handled attached to it and
included in the error message:
>>> try:
... open("database.sqlite")
... except OSError:
... raise RuntimeError("unable to handle error")
...
Traceback (most recent call last):
File "<stdin>", line 2, in <module>
FileNotFoundError: [Errno 2] No such file or directory: 'database.sqlite'
To indicate that an exception is a direct consequence of another, the raise statement allows an optional from clause:
This can be useful when you are transforming exceptions. For example:
The above exception was the direct cause of the following exception:
It also allows disabling automatic exception chaining using the from None idiom:
>>> try:
... open('database.sqlite')
... except OSError:
... raise RuntimeError from None
...
Traceback (most recent call last):
File "<stdin>", line 4, in <module>
RuntimeError
Programs may name their own exceptions by creating a new exception class (see Classes for more about Python classes).
Exceptions should typically be derived from the Exception class, either directly or indirectly.
Exception classes can be defined which do anything any other class can do, but are usually kept simple, often only offering
a number of attributes that allow information about the error to be extracted by handlers for the exception.
Most exceptions are defined with names that end in “Error”, similar to the naming of the standard exceptions.
Many standard modules define their own exceptions to report errors that may occur in functions they define.
The try statement has another optional clause which is intended to define clean-up actions that must be executed under
all circumstances. For example:
>>> try:
... raise KeyboardInterrupt
... finally:
... print('Goodbye, world!')
...
Goodbye, world!
Traceback (most recent call last):
File "<stdin>", line 2, in <module>
KeyboardInterrupt
If a finally clause is present, the finally clause will execute as the last task before the try statement completes.
The finally clause runs whether or not the try statement produces an exception. The following points discuss more
complex cases when an exception occurs:
• If an exception occurs during execution of the try clause, the exception may be handled by an except clause. If
the exception is not handled by an except clause, the exception is re-raised after the finally clause has been
executed.
• An exception could occur during execution of an except or else clause. Again, the exception is re-raised after
the finally clause has been executed.
• If the finally clause executes a break, continue or return statement, exceptions are not re-raised.
• If the try statement reaches a break, continue or return statement, the finally clause will execute just
prior to the break, continue or return statement’s execution.
• If a finally clause includes a return statement, the returned value will be the one from the finally clause’s
return statement, not the value from the try clause’s return statement.
For example:
As you can see, the finally clause is executed in any event. The TypeError raised by dividing two strings is not
handled by the except clause and therefore re-raised after the finally clause has been executed.
In real world applications, the finally clause is useful for releasing external resources (such as files or network con-
nections), regardless of whether the use of the resource was successful.
Some objects define standard clean-up actions to be undertaken when the object is no longer needed, regardless of whether
or not the operation using the object succeeded or failed. Look at the following example, which tries to open a file and
print its contents to the screen.
The problem with this code is that it leaves the file open for an indeterminate amount of time after this part of the code
has finished executing. This is not an issue in simple scripts, but can be a problem for larger applications. The with
statement allows objects like files to be used in a way that ensures they are always cleaned up promptly and correctly.
with open("myfile.txt") as f:
for line in f:
print(line, end="")
After the statement is executed, the file f is always closed, even if a problem was encountered while processing the lines.
Objects which, like files, provide predefined clean-up actions will indicate this in their documentation.
There are situations where it is necessary to report several exceptions that have occurred. This is often the case in
concurrency frameworks, when several tasks may have failed in parallel, but there are also other use cases where it is
desirable to continue execution and collect multiple errors rather than raise the first exception.
The builtin ExceptionGroup wraps a list of exception instances so that they can be raised together. It is an exception
itself, so it can be caught like any other exception.
By using except* instead of except, we can selectively handle only the exceptions in the group that match a certain
type. In the following example, which shows a nested exception group, each except* clause extracts from the group
exceptions of a certain type while letting all other exceptions propagate to other clauses and eventually to be reraised.