Links: science communication, statistics, large data analysis

A long overdue set of links and quick thoughts:

Communicating research


Data analysis

  • Revolutions has advice for dealing with large data sets from the 2010 Workshop on Algorithms for Modern Massive Data Sets.

  • The larry package for manipulating tables in Python. This uses NumPy under the covers and is similar to dealing with data.frames in R.

  • Will describes how callbacks can drive an analysis pipeline. As analysis workflows get more complicated, your code can get to be a mess of special cases and become really fragile. Here he passes around functions through a standard runner to help generalize and abstract the process.




Leave a Reply

Fill in your details below or click an icon to log in: Logo

You are commenting using your account. Log Out /  Change )

Google+ photo

You are commenting using your Google+ account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )


Connecting to %s