You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 48 Next »

Basic Data Types

Constant values by the types

ValuePython Expression
Hexa decimal a10xa1

  • Horizontal Tab character
  • Newline (ASCII Linefeed) character
  • The character with hexadecimal value a0
  • '\t\n\xa0'
  • "\t\n\xa0"
  • '''\t\n\xa0'''
  • """\t\n\xa0"""

Basic Operators

Python Arithmetic Operators

+ AdditionAdds values on either side of the operator.10 + 20 = 30
- SubtractionSubtracts right hand operand from left hand operand.10 – 20 = -10
* MultiplicationMultiplies values on either side of the operator10 * 20 = 200
/ DivisionDivides left hand operand by right hand operand20 / 10 = 2
% ModulusDivides left hand operand by right hand operand and returns remainder20 % 10 = 0
** ExponentPerforms exponential (power) calculation on operators10**20 =10 to the power 20
//Floor Division - The division of operands where the result is the quotient in which the digits after the decimal point are removed. But if one of the operands is negative, the result is floored, i.e., rounded away from zero (towards negative infinity) −

9//2 = 4 and 9.0//2.0 = 4.0

-11//3 = -4

-11.0//3 = -4.0

Python Comparison Operators

Below example is based on the condition as a=10, b=20

==If the values of two operands are equal, then the condition becomes true.(a == b) is not true.
!=If values of two operands are not equal, then condition becomes true.(a != b) is true.
<>If values of two operands are not equal, then condition becomes true.(a <> b) is true. This is similar to != operator.
>If the value of left operand is greater than the value of right operand, then condition becomes true.(a > b) is not true.
<If the value of left operand is less than the value of right operand, then condition becomes true.(a < b) is true.
>=If the value of left operand is greater than or equal to the value of right operand, then condition becomes true.(a >= b) is not true.
<=If the value of left operand is less than or equal to the value of right operand, then condition becomes true.(a <= b) is true.

Data Structures - List / Set / Tuple / Dictionary


list1 = ['physics', 'chemistry', 1997, 2000];
list2 = [1, 2, 3, 4, 5 ];
list3 = ["a", "b", "c", "d"]

Split string as list

sentence = "the quick brown fox jumps over the lazy dog"
words = sentence.split()

Filter positive numbers only - 1

numbers = [34.6, -203.4, 44.9, 68.3, -12.2, 44.6, 12.7]
newlist = []
for number in numbers:
    if number>0:

Filter positive numbers only - 2

numbers = [34.6, -203.4, 44.9, 68.3, -12.2, 44.6, 12.7]
newlist = [int(x) for x in numbers if x > 0]

Create word list from a sentence with no duplicate entries

set() removes all the duplicate entries in the array

strings = "my name is Chun Kang and Chun is my name"
r = set(strings.split())

Python List REPL sessions

a = ['foo', 'bar', 'baz', 'qux', 'quux', 'corge']

print(a[:] is a)
print(max(a[2:4] + ['grault']))

Diagram for the list indices:



['bar', 'baz']
['quux', 'baz', 'foo']


The tuples cannot be changed unlike lists and tuples use parentheses, whereas lists use square brackets.

tup1 = ('physics', 'chemistry', 1997, 2000);
tup2 = (1, 2, 3, 4, 5 );
tup3 = "a", "b", "c", "d";


Unordered collections of unique elements

Set(['Jane', 'Marvin', 'Janice', 'John', 'Jack'])
Set(['Janice', 'Jack', 'Sam'])
Set(['Jane', 'Zack', 'Jack'])
Set(['Jack', 'Sam', 'Jane', 'Marvin', 'Janice', 'John', 'Zack'])

Find overlapped entries from two arrays

a = set([ "Seoul", "Pusan", "Incheon", "Mokpo" ])
b = set([ "Seoul", "Incheon", "Suwon", "Daejeon", "Gwangjoo", "Taeku"])


The result will be like below


{'Seoul', 'Incheon'}

{'Seoul', 'Incheon'}

Find different elements from two arrays based on "symmetric_difference" method

a = set(["Jake", "John", "Eric"])
b = set(["John", "Jill"])


The result will be like below


{'Jake', 'Eric', 'Jill'}

{'Eric', 'Jake', 'Jill'}

Find different elements from two arrays based on "difference" method

a = set(["Jake", "John", "Eric"])
b = set(["John", "Jill"])


The result will be like below


{'Jake', 'Eric'}


Find different elements from two arrays based on "union" method

a = set(["Jake", "John", "Eric"])
b = set(["John", "Jill"])


The result will be like below


{'John', 'Eric', 'Jake', 'Jill'}

Print out a set containing all the participants from event A which did not attend event B

a = ["Jake", "John", "Eric"]
b = ["John", "Jill"]


Find sorted unique names in two list

def unique_names(names1, names2):
    return sorted(set(names1+names2))

names1 = ["Ava", "Emma", "Olivia"]
names2 = ["Olivia", "Sophia", "Emma"]
print(unique_names(names1, names2)) # should print Ava, Emma, Olivia, Sophia


Python dictionaries are similar to lists in that they are mutable and can be nested to any arbitrary depth (constrained only by available memory).

A dictionary can contain any type of Python object, including another dictionary. The keys in a given dictionary do not need to be the same type as one another, nor do the values.

Dictionary elements are accessed by key. Unlike with list indexing, the order of the items in a dictionary plays no role in how the items are accessed.

Even though dictionary access does not rely on item order, as of version 3.7 the Python language specification does guarantee that the order of items in a dictionary is maintained once the dictionary is created.

dict = {'Name': 'Zara', 'Age': 7, 'Class': 'First'}

Get last name from full name by split()

The function can be easily implemented by string method

actor = {"name": "John Cleese", "rank": "awesome"}

def get_last_name():
    return actor["name"].split()[1]

print("All exceptions caught! Good job!")
print("The actor's last name is %s" % get_last_name())

Accessing dictionary values

x = [
		'foo': 1,
			'x' : 10,
			'y' : 20,
			'z' : 30
		'baz': 3




Delete a dictionary element

Deleting a dictionary element by statement

del d['foo']

Deleting a dictionary element by method


Copying a dictionary

Method 1)

d2 = dict(d1)

Method 2)

d2 = dict(d1.items())

Method 3)

d2 = {}


Random number generation

import random

def lottery():
    # returns 6 numbers between 1 and 40
    for i in range(6):
        yield random.randint(1, 40)

    # returns a 7th number between 1 and 15
    yield random.randint(1,15)

for random_number in lottery():
       print("And the next number is... %d!" %(random_number))

Swap variables' value

a = 1
b = 2
a, b = b, a

Fibonacci series generator

The first two numbers of the series is always equal to 1, and each consecutive number returned is the sum of the last two numbers - the below code uses only two variables to get the result.

def fib():
    a, b = 1, 1
    while 1:
        yield a
        a, b = b, a + b

# testing code
import types
if type(fib()) == types.GeneratorType:
    print("Good, The fib function is a generator.")

    counter = 0
    for n in fib():
        counter += 1
        if counter == 10:

Function Arguments(Parameters)

Multiple Function Argument recognition - the list of "therest" parameters

def foo(first, second, third, *therest):
    print("First: %s" %(first))
    print("Second: %s" %(second))
    print("Third: %s" %(third))
    print("And all the rest... %s" %(list(therest)))


Multiple Function Argument by keyword

def bar(first, second, third, **options):
    if options.get("action") == "sum":
        print("The sum is: %d" %(first + second + third))

    if options.get("number") == "first":
        return first

result = bar(1, 2, 3, action = "sum", number = "first")
print("Result: %d" %(result))

Regular Expression

RegEx(Regular Expressions) to search "[on]" or "[off]" on the string

import re

pattern = re.compile(r"\[(on|off)\]") # Slight optimization
print(, "Mono: Playback 65 [75%] [-16.50dB] [on]"))

RegEx(Regular Expression) to check email address

import re

def test_email(your_pattern):
    pattern = re.compile(your_pattern)
    emails = ["", "", "wha.t.`1an?ug{}"]
    for email in emails:
        if not re.match(pattern, email):
            print("You failed to match %s" % (email))
        elif not your_pattern:
            print("Forgot to enter a pattern!")

pattern = r"[a-z0-9]+@[a-z0-9]+\.[a-z0-9]+"

Exception Handling

try/except block

def do_stuff_with_number(n):

def catch_this():
    the_list = (1, 2, 3, 4, 5)

    for i in range(20):
        except IndexError: # Raised when accessing a non-existing index of a list
            do_stuff_with_number('out of bound - %d' % i)



Convert arrays to Numpy arrays

# Create 2 new lists height and weight
height = [1.87,  1.87, 1.82, 1.91, 1.90, 1.85]
weight = [81.65, 97.52, 95.25, 92.98, 86.18, 88.45]

# Import the numpy package as np
import numpy as np

# Create 2 numpy arrays from height and weight
np_height = np.array(height)
np_weight = np.array(weight)


# Calculate bmi
bmi = np_weight / np_height ** 2

# Print the result

# For a boolean response
print(bmi > 23)

# Print only those observations above 23
print(bmi[bmi > 23])


<class 'numpy.ndarray'>
[ 23.34925219  27.88755755  28.75558507  25.48723993  23.87257618
[ True  True  True  True  True  True]
[ 23.34925219  27.88755755  28.75558507  25.48723993  23.87257618

Convert all of the weights from kilograms to pounds based in NumPy

weight_kg = [81.65, 97.52, 95.25, 92.98, 86.18, 88.45]

import numpy as np

# Create a numpy array np_weight_kg from weight_kg
np_weight_kg = np.array(weight_kg)

# Create np_weight_lbs from np_weight_kg
np_weight_lbs = np_weight_kg * 2.2

# Print out np_weight_lbs


    [ 179.63   214.544  209.55   204.556  189.596  194.59 ]

Pandas DataFrame / CSV / Join / Merge

Create a Pandas DataFrame based on array

dict = {"country": ["Brazil", "Russia", "India", "China", "South Africa"],
       "capital": ["Brasilia", "Moscow", "New Dehli", "Beijing", "Pretoria"],
       "area": [8.516, 17.10, 3.286, 9.597, 1.221],
       "population": [200.4, 143.5, 1252, 1357, 52.98] }

import pandas as pd
brics = pd.DataFrame(dict)

Adding index to a Pandas DataFrame

# Set the index for brics
brics.index = ["BR", "RU", "IN", "CH", "SA"]

# Print out brics with new index values

Reading CSV by Pandas DataFrame

# Import pandas as pd
import pandas as pd

# Import the cars.csv data: cars
cars = pd.read_csv('cars.csv')

# Print out cars


Reading a CSV file by Pandas DataFrame with 1st column as index

# Import pandas and cars.csv
import pandas as pd
cars = pd.read_csv('cars.csv', index_col = 0)

# Print out country column as Pandas Series

# Print out country column as Pandas DataFrame

# Print out DataFrame with country and drives_right columns
print(cars[['cars_per_cap', 'country']])

Save a Pandas DaraFrame by CSV format

dict = {"country": ["Brazil", "Russia", "India", "China", "South Africa"],
       "capital": ["Brasilia", "Moscow", "New Dehli", "Beijing", "Pretoria"],
       "area": [8.516, 17.10, 3.286, 9.597, 1.221],
       "population": [200.4, 143.5, 1252, 1357, 52.98] }

import pandas as pd
brics = pd.DataFrame(dict)


Save a Pandas DaraFrame by CSV format with header and no index

from pandas import DataFrame

Cars = {'Brand': ['Honda Civic','Toyota Corolla','Ford Focus','Audi A4'],
        'Price': [22000,25000,27000,35000]

df = DataFrame(Cars, columns= ['Brand', 'Price'])

export_csv = df.to_csv (r'C:\Users\Ron\Desktop\export_dataframe.csv', index = None, header=True) #Don't forget to add '.csv' at the end of the path

print (df)

Print partial rows (observations) from a Pandas DataFrame

# Import cars data
import pandas as pd
cars = pd.read_csv('cars.csv', index_col = 0)

# Print out first 4 observations

# Print out fifth, sixth, and seventh observation

Data access by loc and iloc in Pandas DaraFrame - Select colums by index or name

loc is label-based, and iloc is integer index based

# Import cars data
import pandas as pd
cars = pd.read_csv('cars.csv', index_col = 0)

# Print out observation for Japan

# Print out observations for Australia and Egypt
print(cars.loc[['AUS', 'EG']])


Sort a Pandas DataFrame in an ascending order

df.sort_values(by=['Brand'], inplace=True)
# sort - ascending order
from pandas import DataFrame
Cars = {'Brand': ['Honda Civic','Toyota Corolla','Ford Focus','Audi A4'],
        'Price': [22000,25000,27000,35000],
        'Year': [2015,2013,2018,2018]
df = DataFrame(Cars, columns= ['Brand', 'Price','Year'])

# sort Brand - ascending order
df.sort_values(by=['Brand'], inplace=True)

print (df)

Sort a Pandas DataFrame in a descending order

df.sort_values(by=['Brand'], inplace=True, ascending=False)
# sort - descending order
from pandas import DataFrame
Cars = {'Brand': ['Honda Civic','Toyota Corolla','Ford Focus','Audi A4'],
        'Price': [22000,25000,27000,35000],
        'Year': [2015,2013,2018,2018]
df = DataFrame(Cars, columns= ['Brand', 'Price','Year'])

# sort Brand - descending order
df.sort_values(by=['Brand'], inplace=True, ascending=False)

print (df)

Sort a Pandas DataFrame by multiple columns

df.sort_values(by=['First Column','Second Column',...], inplace=True)
# sort by multiple columns
from pandas import DataFrame
Cars = {'Brand': ['Honda Civic','Toyota Corolla','Ford Focus','Audi A4'],
        'Price': [22000,25000,27000,35000],
        'Year': [2015,2013,2018,2018]
df = DataFrame(Cars, columns= ['Brand', 'Price','Year'])

# sort by multiple columns: Year and Price
df.sort_values(by=['Year','Price'], inplace=True)

print (df)

Join and merge Pandas DataFrames

import pandas as pd
from IPython.display import display
from IPython.display import Image

raw_data = {
        'subject_id': ['1', '2', '3', '4', '5'],
        'first_name': ['Alex', 'Amy', 'Allen', 'Alice', 'Ayoung'], 
        'last_name': ['Anderson', 'Ackerman', 'Ali', 'Aoni', 'Atiches']}
df_a = pd.DataFrame(raw_data, columns = ['subject_id', 'first_name', 'last_name'])

raw_data = {
        'subject_id': ['4', '5', '6', '7', '8'],
        'first_name': ['Billy', 'Brian', 'Bran', 'Bryce', 'Betty'], 
        'last_name': ['Bonder', 'Black', 'Balwner', 'Brice', 'Btisan']}
df_b = pd.DataFrame(raw_data, columns = ['subject_id', 'first_name', 'last_name'])

raw_data = {
        'subject_id': ['1', '2', '3', '4', '5', '7', '8', '9', '10', '11'],
        'test_id': [51, 15, 15, 61, 16, 14, 15, 1, 61, 16]}
df_n = pd.DataFrame(raw_data, columns = ['subject_id','test_id'])

# Join the two dataframes along rows
df_new = pd.concat([df_a, df_b])

# Join the two dataframes along columns
pd.concat([df_a, df_b], axis=1)

# Merge two dataframes along the subject_id value
pd.merge(df_new, df_n, on='subject_id')

# Merge two dataframes with both the left and right dataframes using the subject_id key
pd.merge(df_new, df_n, left_on='subject_id', right_on='subject_id')

# Merge with outer join
pd.merge(df_a, df_b, on='subject_id', how='outer')

# Merge with inner join
pd.merge(df_a, df_b, on='subject_id', how='inner')

# Merge with right join
pd.merge(df_a, df_b, on='subject_id', how='right')

# Merge with left join
pd.merge(df_a, df_b, on='subject_id', how='left')

# Merge while adding a suffix to duplicate column names
pd.merge(df_a, df_b, on='subject_id', how='left', suffixes=('_left', '_right'))

# Merge based on indexes
pd.merge(df_a, df_b, right_index=True, left_index=True)

Get the maximum value of column in Pandas DataFrame

import pandas as pd
# Create a DataFrame
d = {
df = pd.DataFrame(d,columns=['Name','Age','Score'])

# get the maximum values of all the column in dataframe - it will be raghu, 26, 89, object

# get the maximum value of the column 'Age' - it will be 26

# get the maximum value of the column 'Name' - it will be raghu

Get the minimum value of column in Pandas DataFrame

import pandas as pd
# Create a DataFrame
d = {
df = pd.DataFrame(d,columns=['Name','Age','Score'])

# get the minimum values of all the column in dataframe - it will display Alex, 22, 31, object

# get the minimum value of the column 'Age' - it will be 22

# get the minimum value of the column 'Name' - it will be Alex

Select row with maximum and minimum value in Pandas DataFrame

import pandas as pd
# Create a DataFrame
d = {
df = pd.DataFrame(d,columns=['Name','Age','Score'])

# get the row of max value

# get the row of minimum value

Get the unique values (rows) of a Pandas Dataframe

import pandas as pd
# Create a DataFrame
d = {
df = pd.DataFrame(d,columns=['Name','Age'])

# get the unique values (rows)
print df.drop_duplicates()

# get the unique values (rows) by retaining last row
print df.drop_duplicates(keep='last')

Get the list of column headers or column name in a Pandas DataFrame

import pandas as pd
# Create a DataFrame
d = {
df = pd.DataFrame(d,columns=['Name','Age','Score'])

# method 1: get list of column name

# method 2: get list of column name

Delete or Drop the duplicate row of a Pandas DataFrame

import pandas as pd
# Create a DataFrame
d = {
df = pd.DataFrame(d,columns=['Name','Age','Score'])

# drop duplicate rows

# drop duplicate rows by retaining last occurrence

# drop duplicate by a column name
df.drop_duplicates(['Name'], keep='last')

Drop or delete the row in Pandas DataFrame with conditions

import pandas as pd
# Create a DataFrame
d = {
df = pd.DataFrame(d,columns=['Name','Age','Score'])

# Drop an observation or row

# Drop a row by condition
df[df.Name != 'Alisa']

# Drop a row by index

# Drop bottom 3 rows

Reshape wide to long in Pandas DataFrame with melt() function

import pandas as pd
# Create a DataFrame
d = {
df = pd.DataFrame(d,columns=['countries','population_in_million','gdp_percapita'])

# shape from wide to long with melt function in pandas
df2=pd.melt(df,id_vars=['countries'],var_name='metrics', value_name='values')

Reshape long to wide in Pandas DataFrame with pivot function

import pandas as pd
# Create a DataFrame
d = {
df = pd.DataFrame(d,columns=['countries','metrics','values'])

# reshape from long to wide in pandas python
df2=df.pivot(index='countries', columns='metrics', values='values')

Reshape using Stack() and unstack() function in Pandas DataFrame

import pandas as pd
header = pd.MultiIndex.from_product([['Semester1','Semester2'],['Maths','Science']])
df = pd.DataFrame(d,

# stack the dataframe

# unstack the dataframe
unstacked_df = stacked_df.unstack()

# stack the dataframe of column at level 0

# unstack the dataframe
unstacked_df1 = stacked_df_lvl.unstack()


Palindrome number - Determine whether an integer is a palindrome

def is_palindrome(word):
    j = len(word)-1
    i =0
    while i<j and word[i].lower()==word[j].lower():
    return (i>=j)

Two sum -  return indices of the two numbers such that they add up to a specific target

Given nums = [2, 7, 11, 15], target = 9,

Because nums[0] + nums[1] = 2 + 7 = 9,
return [0, 1].

def twoSum(self, nums, target):
        seen = {}
        for i, v in enumerate(nums):
            remaining = target - v
            if remaining in seen:
                return [seen[remaining], i]
            seen[v] = i
        return []

Reverse integer - Given a 32-bit signed integer, reverse digits of an integer.

Input: 123 → Output: 321
Input: -123 → Output: -321
Input: 120 → Output: 21

class Solution(object):
    def reverse(self, x):
        if x >= 2**31-1 or x <= -2**31:
            return 0
            strg = str(x)
        if x >= 0 :
            revst = strg[::-1]
            temp = strg[1:] 
            temp2 = temp[::-1] 
            revst = "-" + temp2
        if int(revst) >= 2**31-1 or int(revst) <= -2**31:
            return 0
            return int(revst)

Merge two sorted linked lists and return it as a new list. 

Input: 1->2->4, 1->3->4
Output: 1->1->2->3->4->4

class Solution:
	def mergeTwoLists(self, l1, l2):

		result = ListNode(0) # The new list we are going to eventually return
		head = result # keep a pointer to the head so we can return in the end
		while(l1 != None and l2 != None): # This check is important in the case where one list is shorter than the other
			if l1.val < l2.val: # Add l1's value as a new node to result if its less than l2's = ListNode(l1.val)
				l1 =
				result =
			elif l2.val < l1.val: # Add l2's value as a new node to result if its less than l1's = ListNode(l2.val)
				l2 =
				result =
			else: # In this case, the values must be equal so add both to result and move the linked lists forward = ListNode(l1.val)
				result = = ListNode(l2.val)
				result =
				l1 =
				l2 =

		if l1 == None and l2 != None: # If l2 is longer than l1, add all of the remaining values of l2 to result
			while(l2 != None): = ListNode(l2.val)
				result =
				l2 =
		elif l2 == None and l1 != None: # if l1 is longer than l2, add all of the remaining values of l1 to result
			while(l1 != None): = ListNode(l1.val)
				result =
				l1 =

		return # return the result

Remove Duplicates from Sorted Array

class Solution(object):
    def removeDuplicates(self, nums):
        if not nums:
            return 0
        i = 1
        bound = len(nums)
        prev = nums[0]
        while i < bound:
            if prev == nums[i]:
                bound = len(nums)
                prev = nums[i]
                i += 1
        return len(nums)
  • No labels