larray.LArray.apply

LArray.apply(self, transform, *args, **kwargs)[source]

Apply a transformation function to array elements.

Parameters
transformfunction

Function to apply. This function will be called in turn with each element of the array as the first argument and must return an LArray, scalar or tuple. If returning arrays the axes of those arrays must be the same for all calls to the function.

*args

Extra arguments to pass to the function.

bystr, int or Axis or tuple/list/AxisCollection of the them, optional

Axis or axes along which to iterate. The function will thus be called with arrays having all axes not mentioned. Defaults to None (all axes). Mutually exclusive with the axes argument.

axesstr, int or Axis or tuple/list/AxisCollection of the them, optional

Axis or axes the arrays passed to the function will have. Defaults to None (the function is given scalars). Mutually exclusive with the by argument.

dtypetype or list of types, optional

Output(s) data type(s). Defaults to None (inspect all output values to infer it automatically).

ascendingbool, optional

Whether or not to iterate the axes in ascending order (from start to end). Defaults to True.

**kwargs

Extra keyword arguments are passed to the function (as keyword arguments).

Returns
LArray or scalar, or tuple of them

Axes will be the union of those in axis and those of values returned by the function.

Examples

First let us define a test array

>>> arr = LArray([[0, 2, 1],
...               [3, 1, 5]], 'a=a0,a1;b=b0..b2')
>>> arr
a\b  b0  b1  b2
 a0   0   2   1
 a1   3   1   5

Here is a simple function we would like to apply to each element of the array. Note that this particular example should rather be written as: arr ** 2 as it is both more concise and much faster.

>>> def square(x):
...     return x ** 2
>>> arr.apply(square)
a\b  b0  b1  b2
 a0   0   4   1
 a1   9   1  25

Functions can also be applied along some axes:

>>> # this is equivalent to (but much slower than): arr.sum('a')
... arr.apply(sum, axes='a')
b  b0  b1  b2
    3   3   6
>>> # this is equivalent to (but much slower than): arr.sum_by('a')
... arr.apply(sum, by='a')
a  a0  a1
    3   9

Applying the function along some axes will return an array with the union of those axes and the axes of the returned values. For example, let us define a function which returns the k highest values of an array.

>>> def topk(a, k=2):
...     return a.sort_values(ascending=False).ignore_labels().i[:k]
>>> arr.apply(topk, by='a')
a\b*  0  1
  a0  2  1
  a1  5  3

Other arguments can be passed to the function:

>>> arr.apply(topk, 3, by='a')
a\b*  0  1  2
  a0  2  1  0
  a1  5  3  1

or by using keyword arguments:

>>> arr.apply(topk, by='a', k=3)
a\b*  0  1  2
  a0  2  1  0
  a1  5  3  1

If the function returns several values (as a tuple), the result will be a tuple of arrays. For example, let use define a function which decompose an array in its mean and the difference to that mean :

>>> def mean_decompose(a):
...     mean = a.mean()
...     return mean, a - mean
>>> mean_by_a, diff_to_mean = arr.apply(mean_decompose, by='a')
>>> mean_by_a
a   a0   a1
   1.0  3.0
>>> diff_to_mean
a\b    b0    b1   b2
 a0  -1.0   1.0  0.0
 a1   0.0  -2.0  2.0