Collections module 2023 - CodingCompiler

The built-in collections package provides several specialized, flexible collection types that are both high-performance and provide alternatives to the general collection types of dict, list, tuple and set. The Collections module also defines abstract base classes describing diﬀerent types of collection functionality (such as MutableSet and ItemsView).

Table of Contents

Collections module: collections.Counter

Counter is a dict sub class that allows you to easily count objects. It has utility methods for working with the frequencies of the objects that you are counting.

import collections
counts = collections.Counter([1,2,3])

the above code creates an object, counts, which has the frequencies of all the elements passed to the constructor. This example has the value Counter({1: 1, 2: 1, 3: 1})

Constructor examples

Letter Counter

collections.Counter('Happy Birthday')
Counter({'a': 2, 'p': 2, 'y': 2, 'i': 1, 'r': 1, 'B': 1, ' ': 1, 'H': 1, 'd': 1, 'h': 1, 't': 1})

Word Counter

collections.Counter('I am Sam Sam I am That Sam-I-am That Sam-I-am! I do not like that Sam-I-am'.split())
Counter({'I': 3, 'Sam': 2, 'Sam-I-am': 2, 'That': 2, 'am': 2, 'do': 1, 'Sam-I-am!': 1, 'that': 1,
'not': 1, 'like': 1})

Recipes

c = collections.Counter({'a': 4, 'b': 2, 'c': -2, 'd': 0})

Get count of individual element

c['a']
4

Set count of individual element

c['c'] = -3
c
Counter({'a': 4, 'b': 2, 'd': 0, 'c': -3})

Get total number of elements in counter (4 + 2 + 0 - 3)

sum(c.itervalues()) # negative numbers are counted!
3

Get elements (only those with positive counter are kept)

list(c.elements())
['a', 'a', 'a', 'a', 'b', 'b']

Remove keys with 0 or negative value

c – collections.Counter() Counter({‘a’: 4, ‘b’: 2})

Remove everything

c.clear()
c
Counter()

Add remove individual elements

c.update({'a': 3, 'b':3})
c.update({'a': 2, 'c':2}) # adds to existing, sets if they don't exist
c
Counter({'a': 5, 'b': 3, 'c': 2})
c.subtract({'a': 3, 'b': 3, 'c': 3}) # subtracts (negative values are allowed)
c
Counter({'a': 2, 'b': 0, 'c': -1})

Collections module: collections.OrderedDict

The order of keys in Python dictionaries is arbitrary: they are not governed by the order in which you add them.

For example:

d = {'foo': 5, 'bar': 6}
print(d)
{'foo': 5, 'bar': 6}
d['baz'] = 7
print(a)
{'baz': 7, 'foo': 5, 'bar': 6}
d['foobar'] = 8
print(a)
{'baz': 7, 'foo': 5, 'bar': 6, 'foobar': 8}

“`

(The arbitrary ordering implied above means that you may get diﬀerent results with the above code to that shown here.)

The order in which the keys appear is the order which they would be iterated over, e.g. using a for loop.

The collections.OrderedDict class provides dictionary objects that retain the order of keys. OrderedDicts can be created as shown below with a series of ordered items (here, a list of tuple key-value pairs):

from collections import OrderedDict
d = OrderedDict([('foo', 5), ('bar', 6)])
print(d)
OrderedDict([('foo', 5), ('bar', 6)])
d['baz'] = 7
print(d)
OrderedDict([('foo', 5), ('bar', 6), ('baz', 7)])

d['foobar'] = 8

print(d)
OrderedDict([('foo', 5), ('bar', 6), ('baz', 7), ('foobar', 8)])

Or we can create an empty OrderedDict and then add items:

o = OrderedDict()
o['key1'] = "value1"
o['key2'] = "value2"
print(o)
OrderedDict([('key1', 'value1'), ('key2', 'value2')])

Iterating through an OrderedDict allows key access in the order they were added.

What happens if we assign a new value to an existing key?

d['foo'] = 4
print(d)
OrderedDict([('foo', 4), ('bar', 6), ('baz', 7), ('foobar', 8)])

The key retains its original place in the OrderedDict.

Collections module: collections.defaultdict

collections.defaultdict(default_factory) returns a subclass of dict that has a default value for missing keys. The argument should be a function that returns the default value when called with no arguments. If there is nothing passed, it defaults to None.

state_capitals = collections.defaultdict(str)
state_capitals
defaultdict(, {})

returns a reference to a defaultdict that will create a string object with its default_factory method.

A typical usage of defaultdict is to use one of the builtin types such as str, int, list or dict as the default_factory, since these return empty types when called with no arguments:

str()
''
int()
0
list
[]

Calling the defaultdict with a key that does not exist does not produce an error as it would in a normal dictionary.

state_capitals['Alaska']
''
state_capitals
defaultdict(, {'Alaska': ''})

Another example with int:

fruit_counts = defaultdict(int)
fruit_counts['apple'] += 2 # No errors should occur
fruit_counts
default_dict(int, {'apple': 2})

fruit_counts[‘banana’] # No errors should occur

0
fruit_counts # A new key is created default_dict(int, {'apple': 2, 'banana': 0})

Normal dictionary methods work with the default dictionary

state_capitals['Alabama'] = 'Montgomery'
state_capitals
defaultdict(, {'Alabama': 'Montgomery', 'Alaska': ''})

Using list as the default_factory will create a list for each new key.

s = [('NC', 'Raleigh'), ('VA', 'Richmond'), ('WA', 'Seattle'), ('NC', 'Asheville')]
dd = collections.defaultdict(list)
for k, v in s:
… dd[k].append(v)
dd
defaultdict(,
{'VA': ['Richmond'],
'NC': ['Raleigh', 'Asheville'],
'WA': ['Seattle']})

Collections module: collections.namedtuple

Define a new type Person using namedtuple like this:

Person = namedtuple('Person', ['age', 'height', 'name'])

The second argument is the list of attributes that the tuple will have. You can list these attributes also as either space or comma separated string:

Person = namedtuple('Person', 'age, height, name')

Person = namedtuple('Person', 'age height name')

Once defined, a named tuple can be instantiated by calling the object with the necessary parameters, e.g.:

dave = Person(30, 178, 'Dave')

Named arguments can also be used:

jack = Person(age=30, height=178, name='Jack S.')

Now you can access the attributes of the namedtuple:

print(jack.age) # 30
print(jack.name) # 'Jack S.'

The first argument to the namedtuple constructor (in our example ‘Person’) is the typename. It is typical to use the same word for the constructor and the typename, but they can be diﬀerent:

Human = namedtuple('Person', 'age, height, name')
dave = Human(30, 178, 'Dave')

print(dave) # yields: Person(age=30, height=178, name='Dave')

collections.deque

Returns a new deque object initialized left-to-right (using append()) with data from iterable. If iterable is not specified, the new deque is empty.

Deques are a generalization of stacks and queues (the name is pronounced “deck” and is short for “double-ended queue”). Deques support thread-safe, memory eﬃcient appends and pops from either side of the deque with approximately the same O(1) performance in either direction.

Though list objects support similar operations, they are optimized for fast fixed-length operations and incur O(n) memory movement costs for pop(0) and insert(0, v) operations which change both the size and position of the underlying data representation.

New in version 2.4.

If maxlen is not specified or is None, deques may grow to an arbitrary length. Otherwise, the deque is bounded to the specified maximum length. Once a bounded length deque is full, when new items are added, a corresponding number of items are discarded from the opposite end. Bounded length deques provide functionality similar to the tail filter in Unix. They are also useful for tracking transactions and other pools of data where only the most recent activity is of interest.

Changed in version 2.6: Added maxlen parameter.

from collections import deque
>>> d = deque('ghi') # make a new deque with three items
>>> for elem in d: # iterate over the deque's elements
… print elem.upper()
G
H
I
>>> d.append('j') # add a new entry to the right side
>>> d.appendleft('f') # add a new entry to the left side
>>> d # show the representation of the deque
deque(['f', 'g', 'h', 'i', 'j'])
>>> d.pop() # return and remove the rightmost item
'j'
>>> d.popleft() # return and remove the leftmost item
'f'
>>> list(d) # list the contents of the deque
['g', 'h', 'i']
>>> d[0] # peek at leftmost item
'g'
>>> d[-1] # peek at rightmost item
'i'
>>> list(reversed(d)) # list the contents of a deque in reverse
['i', 'h', 'g']
>>> 'h' in d # search the deque
True
>>> d.extend('jkl') # add multiple elements at once
>>> d
deque(['g', 'h', 'i', 'j', 'k', 'l'])
>>> d.rotate(1) # right rotation
>>> d

deque(['l', 'g', 'h', 'i', 'j', 'k'])

d.rotate(-1) # left rotation
d
deque(['g', 'h', 'i', 'j', 'k', 'l'])
deque(reversed(d)) # make a new deque in reverse order
deque(['l', 'k', 'j', 'i', 'h', 'g'])
d.clear() # empty the deque
d.pop() # cannot pop from an empty deque
Traceback (most recent call last):
File "", line 1, in -toplevel-
d.pop()
IndexError: pop from an empty deque
d.extendleft('abc') # extendleft() reverses the input order
d
deque(['c', 'b', 'a'])

Source: https://docs.python.org/2/library/collections.html

collections.ChainMap

ChainMap is new in version 3.3

Returns a new ChainMap object given a number of maps. This object groups multiple dicts or other mappings together to create a single, updateable view.

ChainMaps are useful managing nested contexts and overlays. An example in the python world is found in the implementation of the Context class in Django’s template engine. It is useful for quickly linking a number of mappings so that the result can be treated as a single unit. It is often much faster than creating a new dictionary and running multiple update() calls.

Anytime one has a chain of lookup values there can be a case for ChainMap. An example includes having both user specified values and a dictionary of default values. Another example is the POST and GET parameter maps found in web use, e.g. Django or Flask. Through the use of ChainMap one returns a combined view of two distinct dictionaries.

The maps parameter list is ordered from first-searched to last-searched. Lookups search the underlying mappings successively until a key is found. In contrast, writes, updates, and deletions only operate on the first mapping.

import collections
define two dictionaries with at least some keys overlapping. dict1 = {'apple': 1, 'banana': 2}
dict2 = {'coconut': 1, 'date': 1, 'apple': 3}
create two ChainMaps with different ordering of those dicts. combined_dict = collections.ChainMap(dict1, dict2) reverse_ordered_dict = collections.ChainMap(dict2, dict1)

Note the impact of order on which value is found first in the subsequent lookup

for k, v in combined_dict.items():
print(k, v)
date 1
apple 1
banana 2

coconut 1
for k, v in reverse_ordered_dict.items():
print(k, v)
date 1
apple 3
banana 2
coconut 1

Learn More

Metaclasses in Python

Classes in Python

Math Module

Complex math

Operator module

Must Read Python Interview Questions

165+ Python Interview Questions & Answers

200+ Python Tutorials With Coding Examples

Python Language Basics Tutorial	Python String Representations of Class Instances
Python For Beginners Tutorial	Python Debugging Tutorial
Python Data Types Tutorial	Reading and Writing CSV File Using Python
Python Indentation Tutorial	Writing to CSV in Python from String/List
Python Comments and Documentation Tutorial	Python Dynamic Code Execution Tutorial
Python Date And Time Tutorial	Python Code Distributing using Pyinstaller
Python Date Formatting Tutorial	Python Data Visualization Tutorial
Python Enum Tutorial	Python Interpreter Tutorial
Python Set Tutorial	Python Args and Kwargs
Python Mathematical Operators Tutorial	Python Garbage Collection Tutorial
Python Bitwise Operators Tutorial	Python Pickle Data Serialisation
Python Bolean Operators Tutorial	Python Binary Data Tutorial
Python Operator Precedance Tutorial	Python Idioms Tutorial
Python Variable Scope And Binding Tutorial	Python Data Serialization Tutorial
Python Conditionals Tutorial	Python Multiprocessing Tutorial
Python Comparisons Tutorial	Python Multithreading Tutorial
Python Loops Tutorial	Python Processes and Threads
Python Arrays Tutorial	Python Concurrency Tutorial
Python Multidimensional Arrays Tutorial	Python Parallel Computation Tutorial
Python List Tutorial	Python Sockets Module Tutorial
Python List Comprehensions Tutorial	Python Websockets Tutorial
Python List Slicing Tutorial	Sockets Encryption Decryption in Python
Python Grouby() Tutorial	Python Networking Tutorial
Python Linked Lists Tutorial	Python http Server Tutorial
Linked List Node Tutorial	Python Flask Tutorial
Python Filter Tutorial	Introduction to Rabbitmq using Amqpstorm Python
Python Heapq Tutorial	Python Descriptor Tutorial
Python Tuple Tutorial	Python Tempflile Tutorial
Python Basic Input And Output Tutorial	Input Subset and Output External Data Files using Pandas in Python
Python Files And Folders I/O Tutorial	Unzipping Files in Python Tutorial
Python os.path Tutorial	Working with Zip Archives in Python
Python Iterables And Iterators Tutorial	gzip in Python Tutorial
Python Functions Tutorial	Stack in Python Tutorial
Defining Functions With List Arguments In Python	Working with Global Interpreter Lock (GIL)
Functional Programming In Python	Python Deployment Tutorial
Partial Functions In Python	Python Logging Tutorial
Decorators Function In Python	Python Server Sent Events Tutorial
Python Classes Tutorial	Python Web Server Gateway Interface (WSGI)
Python Metaclasses Tutorial	Python Alternatives to Switch Statement
Python String Formatting Tutorial	Python Packing and Unpacking Tutorial
Python String Methods Tutorial	Accessing Python Sourcecode and Bytecode
Using Loops Within Functions In Python	Python Mixins Tutorial
Python Importing Modules Tutorial	Python Attribute Access Tutorial
Difference Betweeb Module And Package In Python	Python Arcpy Tutorial
Python Math Module Tutorial	Python Abstract Base Class Tutorial
Python Complex Math Tutorial	Python Plugin and Extension Classes
Python Collections Module Tutorial	Python Immutable Datatypes Tutorial
Python Operator Module Tutorial	Python Incompatibilities Moving from Python 2 to Python 3
Python JSON Module Tutorial	Python 2to3 Tool Tutorial
Python Sqlite3 Module Tutorial	Non-Official Python implementations
Python os Module Tutorial	Python Abstract Syntax Tree
Python Locale Module Tutorial	Python Unicode and Bytes
Python Itertools Module Tutorial	Python Serial Communication (pyserial)
Python Asyncio Module Tutorial	Neo4j and Cypher using Py2Neo
Python Random Module Tutorial	Basic Curses with Python
Python Functools Module Tutorial	Templates in Python
Python dis Module Tutorial	Python Pillow
Python Base64 Module Tutorial	Python CLI subcommands with precise help output
Python Queue Module Tutorial	Python Database Access
Python Deque Module Tutorial	Connecting Python to SQL Server
Python Webbrowser Module Tutorial	Python and Excel
Python tkinter Tutorial	Python Turtle Graphics
Python pyautogui Module Tutorial	Python Persistence
Python Indexing And Slicing Tutorial	Python Design Patterns
Python Plotting With Matplotlib Tutorial	Python hashlib
Python Graph Tool Tutorial	Creating a Windows Service Using Python
Python Generators Tutorial	Mutable vs Immutable (and Hashable) in Python
Python Reduce Tutorial	Python configparser
Python Map Function Tutorial	Python Optical Character Recognition
Python Exponentiation Tutorial	Python Virtual Environments
Python Searching Tutorial	Python Virtual Environment – virtualenv
Sorting Minimum And Maximum In Python	Python Virtual environment with virtualenvwrapper
Python Print Function Tutorial	Create virtual environment with virtualenvwrapper in windows
Python Regular Expressions Regex Tutorial	Python sys Tutorial
Copying Data In Python Tutorial	ChemPy – Python package
Python Context Managers (“with” Statement) Tutorial	Python pygame
Python Name Special Variable Tutorial	Python pyglet
Checking Path Existence And Permissions In Python	Working with Audio in Python
Creating Python Packages Tutorial	Python pyaudio
Usage of pip Module In Python Tutorial	Python shelve
Python PyPi Package Manager Tutorial	IoT Programming with Python and Raspberry PI
Parsing Command Line Arguments In Python	kivy – Cross-platform Python Framework for NUI Development
Python Subprocess Library Tutorial	Pandas Transform
Python setup.py Tutorial	Python vs. JavaScript
Python Recursion Tutorial	Call Python from C#
Python Type Hints Tutorial	Python Writing Extensions
Python Exceptions Tutorial	Python Lex-Yacc
Raise Custom Exceptions In Python	Python Unit Testing
Python Commonwealth Exceptions Tutorial	Python py.test
Python urllib Tutorial	Python Profiling
Web Scraping With Python Tutorial	Python Speed of Program
Python HTML Parsing Tutorial	Python Performance Optimization
Manipulating XML In Python	Python Security and Cryptography
Python Requests Post Tutorial	Secure Shell Connection in Python
Python Distribution Tutorial	Python Anti Patterns
Python Property Objects Tutorial	Python Common Pitfalls
Python Overloading Tutorial	Python Hidden Features
Python Polymorphism Tutorial	Python For Machine Learning
Python Method Overriding Tutorial	Python Interview Questions And Answers For Experienced
Python User Defined Methods Tutorial	Python Coding Interview Questions And Answers

Python Programming Tutorials With Examples

Collections module