I am a PhD student at Max Planck Institute - Intelligent Systems, working with Manuel Gomez-Rodriguez and Bernhard Schölkopf. I am interested in Machine Learning and Social Netwroks.
Python code is usually stored in text files with the file ending “.py”:
myprogram.py
Every line in a Python program file is assumed to be a Python statement, or part thereof.
The only exception is comment lines, which start with the character # (optionally preceded by an arbitrary number of white-space characters, i.e., tabs or spaces). Comment lines are usually ignored by the Python interpreter.
To run our Python program from the command line we use:
$ python myprogram.py
On UNIX systems it is common to define the path to the interpreter on the first line of the program (note that this is a comment line as far as the Python interpreter is concerned):
#!/usr/bin/env python
If we do, and if we additionally set the file script to be executable, we can run the program like this:
The standard character encoding is ASCII, but we can use any other encoding, for example UTF-8. To specify that UTF-8 is used we include the special line
Other than these two optional lines in the beginning of a Python code file, no additional code is required for initializing a program.
IPython notebooks
This file - an IPython notebook - does not follow the standard pattern with Python code in a text file. Instead, an IPython notebook is stored as a file in the JSON format. The advantage is that we can mix formatted text, Python code and code output. It requires the IPython notebook server to run it though, and therefore isn’t a stand-alone Python program as described above. Other than that, there is no difference between the Python code that goes into a program file or an IPython notebook.
Modules
Most of the functionality in Python is provided by modules. The Python Standard Library is a large collection of modules that provides cross-platform implementations of common facilities such as access to the operating system, file I/O, string management, network communication, and much more.
References
The Python Language Reference: http://docs.python.org/2/reference/index.html
The Python Standard Library: http://docs.python.org/2/library/
To use a module in a Python program it first has to be imported. A module can be imported using the import statement. For example, to import the module math, which contains many standard mathematical functions, we can do:
1
importmath
This includes the whole module and makes it available for use later in the program. For example, we can do:
1
2
3
4
5
importmathx=math.cos(2*math.pi)print(x)
1.0
Alternatively, we can chose to import all symbols (functions and variables) in a module to the current namespace (so that we don’t need to use the prefix “math.” every time we use something from the math module:
1
2
3
4
5
frommathimport*x=cos(2*pi)print(x)
1.0
This pattern can be very convenient, but in large programs that include many modules it is often a good idea to keep the symbols from each module in their own namespaces, by using the import math pattern. This would elminate potentially confusing problems with name space collisions.
As a third alternative, we can chose to import only a few selected symbols from a module by explicitly listing which ones we want to import instead of using the wildcard character *:
1
2
3
4
5
frommathimportcos,pix=cos(2*pi)print(x)
1.0
Finally, one standard practice in the scientific community using Python is to shorten namespaces. This usually works my specifiying two characters or three characters nicknames.
And using the function help we can get a description of each function (almost .. not all functions have docstrings, as they are technically called, but the vast majority of functions are documented this way).
1
help(math.log)
Help on built-in function log in module math:
log(...)
log(x[, base])
Return the logarithm of x to the given base.
If the base not specified, returns the natural logarithm (base e) of x.
1
log(10)
2.302585092994046
1
log(10,2)
3.3219280948873626
We can also use the help function directly on modules: Try
help(math)
Some very useful modules form the Python standard library are os, sys, math, shutil, re, subprocess, multiprocessing, threading.
A complete lists of standard modules for Python 2 and Python 3 are available at http://docs.python.org/2/library/ and http://docs.python.org/3/library/, respectively.
Variables and types
Symbol names
Variable names in Python can contain alphanumerical characters a-z, A-Z, 0-9 and some special characters such as _. Normal variable names must start with a letter.
By convension, variable names start with a lower-case letter, and Class names start with a capital letter.
In addition, there are a number of Python keywords that cannot be used as variable names. These keywords are:
and, as, assert, break, class, continue, def, del, elif, else, except,
exec, finally, for, from, global, if, import, in, is, lambda, not, or,
pass, print, raise, return, try, while, with, yield
Note: Be aware of the keyword lambda, which could easily be a natural variable name in a scientific program. But being a keyword, it cannot be used as a variable name.
Assignment
The assignment operator in Python is =. Python is a dynamically typed language, so we do not need to specify the type of a variable when we create one.
Assigning a value to a new variable creates the variable:
1
2
3
# variable assignments
x=1.0my_variable=12.2
Although not explicitly specified, a variable do have a type associated with it. The type is derived form the value it was assigned.
1
type(x)
float
If we assign a new value to a variable, its type can change.
1
x=1
1
type(x)
int
If we try to use a variable that has not yet been defined we get an NameError:
1
print(y)
---------------------------------------------------------------------------
NameError Traceback (most recent call last)
<ipython-input-19-36b2093251cd> in <module>()
----> 1 print(y)
NameError: name 'y' is not defined
Fundamental types
1
2
3
# integers
x=1type(x)
int
1
2
3
# float
x=1.0type(x)
float
1
2
3
4
5
# boolean
b1=Trueb2=Falsetype(b1)
bool
1
2
3
# complex numbers: note the use of `j` to specify the imaginary part
x=1.0-1.0jtype(x)
complex
1
print(x)
(1-1j)
1
print(x.real,x.imag)
(1.0, -1.0)
Type utility functions
The module types contains a number of type name definitions that can be used to test if variables are of certain types:
1
2
3
4
importtypes# print all types defined in the `types` module
print(dir(types))
# Note! The power operators in python isn't ^, but **
2**2
4
The boolean operators are spelled out as words and, not, or.
1
TrueandFalse
False
1
notFalse
True
1
TrueorFalse
True
Comparison operators >, <, >= (greater or equal), <= (less or equal), == equality, is identical.
1
2>1,2<1
(True, False)
1
2>2,2<2
(False, False)
1
2>=2,2<=2
(True, True)
1
2
# equality
[1,2]==[1,2]
True
1
2
3
4
# objects identical?
l1=l2=[1,2]l1isl2
True
Compound types: Strings, List and dictionaries
Strings
Strings are the variable type that is used for storing text messages.
1
2
s="Hello world"type(s)
str
1
2
# length of the string: the number of characters
len(s)
11
1
2
3
# replace a substring in a string with somethign else
s2=s.replace("world","test")print(s2)
Hello test
We can index a character in a string using []:
1
s[0]
'H'
Heads up MATLAB users: Indexing start at 0!
We can extract a part of a string using the syntax [start:stop], which extracts characters between index start and stop:
1
s[0:5]
'Hello'
If we omit either (or both) of start or stop from [start:stop], the default is the beginning and the end of the string, respectively:
1
s[:5]
'Hello'
1
s[6:]
'world'
1
s[:]
'Hello world'
We can also define the step size using the syntax [start:end:step] (the default value for step is 1, as we saw above):
1
s[::1]
'Hello world'
1
s[::2]
'Hlowrd'
This technique is called slicing. Read more about the syntax here: http://docs.python.org/release/2.7.3/library/functions.html?highlight=slice#slice
Python has a very rich set of functions for text processing. See for example http://docs.python.org/2/library/string.html for more information.
String formatting examples
1
print("str1","str2","str3")# The print statement concatenates strings with a space
('str1', 'str2', 'str3')
1
print("str1",1.0,False,-1j)# The print statements converts all arguments to strings
('str1', 1.0, False, -1j)
1
print("str1"+"str2"+"str3")# strings added with + are concatenated without space
str1str2str3
1
print("value = %f"%1.0)# we can use C-style string formatting
value = 1.000000
1
2
3
4
# this formatting creates a string
s2="value1 = %.2f. value2 = %d"%(3.1415,1.5)print(s2)
value1 = 3.14. value2 = 1
1
2
3
4
# alternative, more intuitive way of formatting a string
s3='value1 = {0}, value2 = {1}'.format(3.1415,1.5)print(s3)
value1 = 3.1415, value2 = 1.5
List
Lists are very similar to strings, except that each element can be of any type.
The syntax for creating lists in Python is [...]:
1
2
3
4
l=[1,2,3,4]print(type(l))print(l)
<type 'list'>
[1, 2, 3, 4]
We can use the same slicing techniques to manipulate lists as we could use on strings:
1
2
3
4
5
print(l)print(l[1:3])print(l[::2])
[1, 2, 3, 4]
[2, 3]
[1, 3]
Heads up MATLAB users: Indexing starts at 0!
1
l[0]
1
Elements in a list do not all have to be of the same type:
1
2
3
l=[1,'a',1.0,1-1j]print(l)
[1, 'a', 1.0, (1-1j)]
Python lists can be inhomogeneous and arbitrarily nested:
1
2
3
nested_list=[1,[2,[3,[4,[5]]]]]nested_list
[1, [2, [3, [4, [5]]]]]
Lists play a very important role in Python, and are for example used in loops and other flow control structures (discussed below). There are number of convenient functions for generating lists of various types, for example the range function:
1
2
3
4
5
start=10stop=30step=2range(start,stop,step)
[10, 12, 14, 16, 18, 20, 22, 24, 26, 28]
1
2
3
# in python 3 range generates an interator, which can be converted to a list using 'list(...)'.
# It has no effect in python 2
list(range(start,stop,step))
params["parameter1"]="A"params["parameter2"]="B"# add a new entry
params["parameter4"]="D"print("parameter1 = "+str(params["parameter1"]))print("parameter2 = "+str(params["parameter2"]))print("parameter3 = "+str(params["parameter3"]))print("parameter4 = "+str(params["parameter4"]))
parameter1 = A
parameter2 = B
parameter3 = 3.0
parameter4 = D
Control Flow
Conditional statements: if, elif, else
The Python syntax for conditional execution of code use the keywords if, elif (else if), else:
1
2
3
4
5
6
7
8
9
10
11
statement1=Falsestatement2=Falseifstatement1:print("statement1 is True")elifstatement2:print("statement2 is True")else:print("statement1 and statement2 are False")
statement1 and statement2 are False
For the first time, here we encounted a peculiar and unusual aspect of the Python programming language: Program blocks are defined by their indentation level.
Compare to the equivalent C code:
if (statement1)
{
printf("statement1 is True\n");
}
else if (statement2)
{
printf("statement2 is True\n");
}
else
{
printf("statement1 and statement2 are False\n");
}
In C blocks are defined by the enclosing curly brakets { and }. And the level of indentation (white space before the code statements) does not matter (completely optional).
But in Python, the extent of a code block is defined by the indentation level (usually a tab or say four white spaces). This means that we have to be careful to indent our code correctly, or else we will get syntax errors.
Examples:
1
2
3
4
5
statement1=statement2=Trueifstatement1:ifstatement2:print("both statement1 and statement2 are True")
both statement1 and statement2 are True
1
2
3
4
# Bad indentation!
ifstatement1:ifstatement2:print("both statement1 and statement2 are True")# this line is not properly indented
File "<ipython-input-90-78979cdecf37>", line 4
print("both statement1 and statement2 are True") # this line is not properly indented
^
IndentationError: expected an indented block
1
2
3
4
5
6
statement1=Falseifstatement1:print("printed if statement1 is True")print("still inside the if block")
1
2
3
4
ifstatement1:print("printed if statement1 is True")print("now outside the if block")
now outside the if block
Loops
In Python, loops can be programmed in a number of different ways. The most common is the for loop, which is used together with iterable objects, such as lists. The basic syntax is:
for loops:
1
2
forxin[1,2,3]:print(x)
1
2
3
The for loop iterates over the elements of the supplied list, and executes the containing block once for each element. Any kind of list can be used in the for loop. For example:
1
2
forxinrange(4):# by default range start at 0
print(x)
parameter4 = D
parameter1 = A
parameter3 = 3.0
parameter2 = B
Sometimes it is useful to have access to the indices of the values when iterating over a list. We can use the enumerate function for this:
1
2
foridx,xinenumerate(range(-3,3)):print(idx,x)
(0, -3)
(1, -2)
(2, -1)
(3, 0)
(4, 1)
(5, 2)
List comprehensions: Creating lists using for loops:
A convenient and compact way to initialize lists:
1
2
3
l1=[x**2forxinrange(0,5)]print(l1)
[0, 1, 4, 9, 16]
while loops:
1
2
3
4
5
6
7
8
i=0whilei<5:print(i)i=i+1print("done")
0
1
2
3
4
done
Note that the print("done") statement is not part of the while loop body because of the difference in indentation.
Functions
A function in Python is defined using the keyword def, followed by a function name, a signature within parentheses (), and a colon :. The following code, with one additional level of indentation, is the function body.
1
2
deffunc0():print("test")
1
func0()
test
Optionally, but highly recommended, we can define a so called “docstring”, which is a description of the functions purpose and behaivor. The docstring should follow directly after the function definition, before the code in the function body.
1
2
3
4
5
6
deffunc1(s):"""
Print a string 's' and tell how many characters it has
"""print(s+" has "+str(len(s))+" characters")
1
help(func1)
Help on function func1 in module __main__:
func1(s)
Print a string 's' and tell how many characters it has
1
func1("test")
test has 4 characters
Functions that returns a value use the return keyword:
1
2
3
4
5
defsquare(x):"""
Return the square of x.
"""returnx**2
1
square(4)
16
We can return multiple values from a function using tuples (see above):
1
2
3
4
5
defpowers(x):"""
Return a few powers of x.
"""returnx**2,x**3,x**4
1
powers(3)
(9, 27, 81)
1
2
3
x2,x3,x4=powers(3)print(x3)
27
Default argument and keyword arguments
In a definition of a function, we can give default values to the arguments the function takes:
1
2
3
4
defmyfunc(x,p=2,debug=False):ifdebug:print("evaluating myfunc for x = "+str(x)+" using exponent p = "+str(p))returnx**p
If we don’t provide a value of the debug argument when calling the the function myfunc it defaults to the value provided in the function definition:
1
myfunc(5)
25
1
myfunc(5,debug=True)
evaluating myfunc for x = 5 using exponent p = 2
25
If we explicitly list the name of the arguments in the function calls, they do not need to come in the same order as in the function definition. This is called keyword arguments, and is often very useful in functions that takes a lot of optional arguments.
1
myfunc(p=3,debug=True,x=7)
evaluating myfunc for x = 7 using exponent p = 3
343
Unnamed functions (lambda function)
In Python we can also create unnamed functions, using the lambda keyword:
1
2
3
4
5
6
f1=lambdax:x**2# is equivalent to
deff2(x):returnx**2
1
f1(2),f2(2)
(4, 4)
This technique is useful for example when we want to pass a simple function as an argument to another function, like this:
1
2
# map is a built-in python function
map(lambdax:x**2,range(-3,4))
[9, 4, 1, 0, 1, 4, 9]
1
2
# in python 3 we can use `list(...)` to convert the iterator to an explicit list
list(map(lambdax:x**2,range(-3,4)))
[9, 4, 1, 0, 1, 4, 9]
Classes
Classes are the key features of object-oriented programming. A class is a structure for representing an object and the operations that can be performed on the object.
In Python a class can contain attributes (variables) and methods (functions).
A class is defined almost like a function, but using the class keyword, and the class definition usually contains a number of class method definitions (a function in a class).
Each class method should have an argument self as it first argument. This object is a self-reference.
Some class method names have special meaning, for example:
__init__: The name of the method that is invoked when the object is first created.
__str__ : A method that is invoked when a simple string representation of the class is needed, as for example when printed.
There are many more, see http://docs.python.org/2/reference/datamodel.html#special-method-names
classPoint:"""
Simple class for representing a point in a Cartesian coordinate system.
"""def__init__(self,x,y):"""
Create a new Point at x, y.
"""self.x=xself.y=ydeftranslate(self,dx,dy):"""
Translate the point by dx and dy in the x and y direction.
"""self.x+=dxself.y+=dydef__str__(self):return("Point at [%f, %f]"%(self.x,self.y))
To create a new instance of a class:
1
2
3
p1=Point(0,0)# this will invoke the __init__ method in the Point class
print(p1)# this will invoke the __str__ method
Point at [0.250000, 1.500000]
Point at [1.000000, 1.000000]
Note that calling class methods can modifiy the state of that particular class instance, but does not effect other class instances or any global variables.
That is one of the nice things about object-oriented design: code such as functions and related variables are grouped in separate and independent entities.
Modules
One of the most important concepts in good programming is to reuse code and avoid repetitions.
The idea is to write functions and classes with a well-defined purpose and scope, and reuse these instead of repeating similar code in different part of a program (modular programming). The result is usually that readability and maintainability of a program is greatly improved. What this means in practice is that our programs have fewer bugs, are easier to extend and debug/troubleshoot.
Python supports modular programming at different levels. Functions and classes are examples of tools for low-level modular programming. Python modules are a higher-level modular programming construct, where we can collect related variables, functions and classes in a module. A python module is defined in a python file (with file-ending .py), and it can be made accessible to other Python modules and programs using the import statement.
Consider the following example: the file mymodule.py contains simple example implementations of a variable, function and a class:
%%filemymodule.py"""
Example of a python module. Contains a variable called my_variable,
a function called my_function, and a class called MyClass.
"""my_variable=0defmy_function():"""
Example function
"""returnmy_variableclassMyClass:"""
Example class.
"""def__init__(self):self.variable=my_variabledefset_variable(self,new_value):"""
Set self.variable to a new value
"""self.variable=new_valuedefget_variable(self):returnself.variable
Writing mymodule.py
We can import the module mymodule into our Python program using import:
1
importmymodule
Use help(module) to get a summary of what the module provides:
1
help(mymodule)
Help on module mymodule:
NAME
mymodule
FILE
/is/ei/btabibian/Documents/scientific-python-lectures/mymodule.py
DESCRIPTION
Example of a python module. Contains a variable called my_variable,
a function called my_function, and a class called MyClass.
CLASSES
MyClass
class MyClass
| Example class.
|
| Methods defined here:
|
| __init__(self)
|
| get_variable(self)
|
| set_variable(self, new_value)
| Set self.variable to a new value
FUNCTIONS
my_function()
Example function
DATA
my_variable = 0
If we make changes to the code in mymodule.py, we need to reload it using reload:
1
reload(mymodule)# works only in python 2
<module 'mymodule' from 'mymodule.pyc'>
Exceptions
In Python errors are managed with a special language construct called “Exceptions”. When errors occur exceptions can be raised, which interrupts the normal program flow and fallback to somewhere else in the code where the closest try-except statement is defined.
To generate an exception we can use the raise statement, which takes an argument that must be an instance of the class BaseException or a class derived from it.
1
raiseException("description of the error")
---------------------------------------------------------------------------
Exception Traceback (most recent call last)
<ipython-input-129-8f47ba831d5a> in <module>()
----> 1 raise Exception("description of the error")
Exception: description of the error
A typical use of exceptions is to abort functions when some error condition occurs, for example:
def my_function(arguments):
if not verify(arguments):
raise Exception("Invalid arguments")
# rest of the code goes here
To gracefully catch errors that are generated by functions and class methods, or by the Python interpreter itself, use the try and except statements:
try:
# normal code goes here
except:
# code for error handling goes here
# this code is not executed unless the code
# above generated an error
For example:
1
2
3
4
5
6
try:print("test")# generate an error: the variable test is not defined
print(test)except:print("Caught an exception")
test
Caught an exception
To get information about the error, we can access the Exception class instance that describes the exception by using for example:
except Exception as e:
1
2
3
4
5
6
try:print("test")# generate an error: the variable test is not defined
print(test)exceptExceptionase:print("Caught an exception:"+str(e))
test
Caught an exception:name 'test' is not defined
Further reading
http://www.python.org - The official web page of the Python programming language.
http://www.python.org/dev/peps/pep-0008 - Style guide for Python programming. Highly recommended.
http://www.greenteapress.com/thinkpython/ - A free book on Python programming.
Internet is very dear to me! That may sound strange but I come from a generation where I understand the difference between Internet and no Internet. I can st...