NumPy: Copies and Views

Open In Colab

NumPy Quickstart

This notebook is a quick introduction to NumPy. It is an interactive version of the NumPy Quickstart Tutorial.

All credits go to the original authors of the tutorial © Copyright 2008-2023, NumPy Developers.

Copies and Views

When operating and manipulating arrays, their data is sometimes copied into a new array and sometimes not. This is often a source of confusion for beginners. There are three cases:

No Copy at All

Simple assignments make no copy of array objects or of their data.

import numpy as np

a = np.array([[0, 1, 2, 3],
              [4, 5, 6, 7],
              [8, 9, 10, 11]])

b = a # no new object is created
print(b is a) # a and b are two names for the same ndarray object
True

Python passes mutable objects as references, so function calls make no copy.

def f(x):
    print(id(x))

print(id(a)) # id is a unique identifier of an object

f(a) # a is passed to the function under the name x
139846954913200
139846954913200

View or Shallow Copy

Different array objects can share the same data. The view method creates a new array object that looks at the same data.

c = a.view() # c is a view of the data owned by a

print("c is a = {}".format(c is a))
print("c.base is a = {}".format(c.base is a)) # c is a view of the data owned by a
print("c.flags.owndata = {}".format(c.flags.owndata)) # c does not own the data

c = c.reshape((2, 6)) # a's shape doesn't change
print("a.shape = {}".format(a.shape))

c[0, 4] = 1234 # a's data changes
print("a =\n{}".format(a))
c is a = False
c.base is a = True
c.flags.owndata = False
a.shape = (3, 4)
a =
[[   0    1    2    3]
 [1234    5    6    7]
 [   8    9   10   11]]

Slicing an array returns a view of it:

s = a[:, 1:3]
s[:] = 10 # s[:] is a view of s. Note the difference between s=10 and s[:]=10

print("a =\n{}".format(a))
a =
[[   0   10   10    3]
 [1234   10   10    7]
 [   8   10   10   11]]

Deep Copy

The copy method makes a complete copy of the array and its data.

d = a.copy() # a new array object with new data is created

print("d is a = {}".format(d is a))
print("d.base is a = {}".format(d.base is a)) # d doesn't share anything with a

d[0, 0] = 9999

print("a =\n{}".format(a))
d is a = False
d.base is a = False
a =
[[   0   10   10    3]
 [1234   10   10    7]
 [   8   10   10   11]]

Sometimes copy should be called after slicing if the original array is not required anymore. For example, suppose a is a huge intermediate result and the final result b only contains a small fraction of a, a deep copy should be made when constructing b with slicing:

a = np.arange(int(1e8))
b = a[:100].copy()

del a # the memory of ``a`` can be released.

If b = a[:100] is used instead, a is referenced by b and will persist in memory even if del a is executed.

Functions and Methods Overview

See Routines for the full list of routines available in NumPy.