Bindata

A convenience Python library to average noisy data over bins. Entirely based on NumPy.

The original idea, and initial implementation, is from A.P. Petroff.

Quick example

Let us generate noisy data:

from pylab import *

x = linspace(0,1,500)
y = sin( 10*x ) + ( rand( len( x ) ) - .5 )

To average the noise out, we group the data into bins, and take the average of each bin.

from bindata import bindata
X, Y = bindata( x, y ).apply()

Here is the result:

plot( x, y, '.', alpha = .3)
plot( X, Y, 'o' )

Choose the bins

By default, 10 bins are linearly distributed along the x axis. We can change this:

X, Y = bindata( x, y, nbins = 20, bins = 'log' ).apply()
X, Y = bindata( x, y, bins = 'equal_size' ).apply()

We can also set the bins by hand.

bd = bindata( x, y, bins = linspace(0.2,.6,10))
X, Y = bd.apply()

plot( x, y, '.', alpha = .3)
plot( X, Y, 'o' )

for bin_boundary in bd.bins :
    axvline( bin_boundary, color = 'k', alpha = .1 )

Not just averages

The bindata object stores the data according to the binning. From that, any statistical quantity can be computed.

b = bindata( x, y, nbins = 15 )
X, Y = b.apply( mean )
sigma_x, sigma_y = b.apply( std )

Here is the result:

plot( x, y, '.', alpha = .3)
errorbar( X, Y, sigma_y, sigma_x, 'o' )

Bin population

Some bins are populated, others are empty:

print( b.nb )

>>> [0, 36, 36, 35, 36, 36, 35, 36, 36, 35, 36, 36, 35, 36, 35, 1]

By default, empty bins produce np.nan values.

print( b.apply()[0] )

>>> [ nan 0.03507014 0.10721443 0.17835671 0.249499 0.32164329 0.39278557 0.46392786 0.53607214 0.60721443 0.67835671 0.750501 0.82164329 0.89278557 0.96392786 1. ]

To change this behavior:

b.apply( empty_as_nan = False )

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
bindata		bindata
examples		examples
figures		figures
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bindata

Quick example

Choose the bins

Not just averages

Bin population

About

Uh oh!

Releases

Packages

Languages

License

odevauchelle/bindata

Folders and files

Latest commit

History

Repository files navigation

Bindata

Quick example

Choose the bins

Not just averages

Bin population

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages