Category Archives: Data

'under the surface'

I find this post on data visualization insightful in a way that connects deeply to how I think about the world. But also because I have never ever (consciously) noticed before the ‘arrow’ in the FedEx logo.

Filed away for future use: Facebook's findings on user use/retention

From the Dataspora Blog, ostensibly on the use and abuse of R, comes this gem about Facebook: Itamar Rosenn, Facebook Itamar conveyed how Facebook’s Data Team used R in 2007 to answer two questions about new users: (i) which data points predict whether a user will stay? and (ii) if they stay, which data points

Two models for understanding what people like, which is better?

I have a sort of interesting question, though perhaps it’s less interesting than I imagine it to be. Given the following two scenarios, which is more likely and why? Scenario 1: In the first case, we have a series of attributes attached to a person, and then we can make arguments (empirical, theoretical) about how

Types of variables, drop-down menus

Over at 37 Signals, they have a regular series detailing their design decisions. It is an insightful feature and an insightful blog. Their latest discussion is about how they managed a question on their support forms. I want to drop some research methodology on this problem. While their discussion is about how to design a

What is XBRL, and Who does XBRL help?

Put it on your radar screens, the next big thing is going to be XBRL. It stands for extensible business reporting language, and it is meant to commensurate business reporting via standardization. So instead of entering text into an annual report, companies, governments, NGOs, anyone who would like to comply with governmental mandate will be