Full Width [alt+shift+f] Shortcuts [alt+shift+k]
Sign Up [alt+shift+s] Log In [alt+shift+l]
11
Our World in Data recently announced that they are providing APIs to access their data. Coincidentally, I am using one of their datasets in my workshop on time series analysis at PyData Global 2024. So I took this opportunity to update my example using the new API – this notebook shows what I learned. Click here to run this notebook on Colab. It is based on Chapter 12 of Think Stats, third edition. import numpy as np import pandas as... Read More Read More The post Download the World in Data appeared first on Probably Overthinking It.
7 months ago

Improve your reading experience

Logged in users get linked directly to articles resulting in a better reading experience. Please login for free, it takes less than 1 minute.

More from Probably Overthinking It

Announcing Think Linear Algebra

I’ve been thinking about Think Linear Algebra for more than a decade, and recently I started working on it in earnest. If you want to get a sense of it, I’ve posted a draft chapter as a Jupyter notebook. In one way, I am glad I waited — I think it will be better, faster [to write], and stronger [?] because of AI tools. To be clear, I am writing this book, not AI. But I’m finding ChatGPT helpful for... Read More Read More The post Announcing Think Linear Algebra appeared first on Probably Overthinking It.

a month ago 20 votes
My very busy week

I’m not sure who scheduled ODSC and PyConUS during the same week, but I am unhappy with their decisions. Last Tuesday I presented a talk and co-presented a workshop at ODSC, and on Thursday I presented a tutorial at PyCon. If you would like to follow along with my very busy week, here are the resources: Practical Bayesian Modeling with PyMC Co-presented with Alex Fengler for ODSC East 2025 In this tutorial, we explore Bayesian regression using PyMC – the... Read More Read More The post My very busy week appeared first on Probably Overthinking It.

a month ago 21 votes
Announcing Think Stats 3e

The third edition of Think Stats is on its way to the printer! You can preorder now from Bookshop.org and Amazon (those are affiliate links), or if you can’t wait to get a paper copy, you can read the free, online version here. Here’s the new cover, still featuring a suspicious-looking archerfish. If you are not familiar with the previous editions, Think Stats is an introduction to practical methods for exploring and visualizing data, discovering relationships and trends, and communicating... Read More Read More The post Announcing Think Stats 3e appeared first on Probably Overthinking It.

3 months ago 28 votes
Young Adults Want Fewer Children

The most recent data from the National Survey of Family Growth (NSFG) provides a first look at people born in the 2000s as young adults and an updated view of people born in the 1990s at the peak of their child-bearing years. Compared to previous generations at the same ages, these cohorts have fewer children, and they are less likely to say they intend to have children. Unless their plans change, trends toward lower fertility are likely to continue for... Read More Read More The post Young Adults Want Fewer Children appeared first on Probably Overthinking It.

3 months ago 25 votes
Algorithmic Fairness

This is the last in a series of excerpts from Elements of Data Science, now available from Lulu.com and online booksellers. This article is based on the Recidivism Case Study, which is about algorithmic fairness. The goal of the case study is to explain the statistical arguments presented in two articles from 2016: Both are about COMPAS, a statistical tool used in the justice system to assign defendants a “risk score” that is intended to reflect the risk that they... Read More Read More The post Algorithmic Fairness appeared first on Probably Overthinking It.

5 months ago 18 votes

More in science

In a First, Solar Was Europe's Biggest Source of Power Last Month

For the first time, solar was the largest source of electricity in the EU last month, supplying a record 22 percent of the bloc's power. Read more on E360 →

18 hours ago 2 votes
The Biggest-Ever Digital Camera Is This Cosmologist’s Magnum Opus

Tony Tyson’s cameras revealed the universe’s dark contents. Now, with the Rubin Observatory’s 3.2-billion-pixel camera, he’s ready to study dark matter and dark energy in unprecedented detail. The post The Biggest-Ever Digital Camera Is This Cosmologist’s Magnum Opus first appeared on Quanta Magazine

16 hours ago 2 votes
US science funding - now time to push on the House appropriators

Some not-actively-discouraging news out of Washington DC yesterday:  The Senate appropriations committee is doing its markups of the various funding bills (which all technically originated in the House), and it appears that they have pushed to keep the funding for NASA and NSF (which are bundled in the same bill with the Department of Justice for no obvious reason) at FY24 levels.  See here as well.   This is not yet a done deal within the Senate, but it's better than many alternatives.  If you are a US citizen or permanent resident and one of your senators is on the appropriations committee, please consider calling them to reinforce how devastating massive budget cuts to these agencies would be.  I am told that feedback to any other senators is also valuable, but appropriators are particularly important here. The House appropriations committee has not yet met to mark up their versions.  They had been scheduled to do so earlier this week but punted it for an unknown time.  Their relevant subcommittee membership is here.  Again, if you are a constituent of one of these representatives, your calls would be particularly important, though it doesn't hurt for anyone to make their views heard to their representative.  If the House version aligns with the presidential budget request, then a compromise between the two might still lead to 30% cuts to NSF and NASA, which would (IMO) still be catastrophic for the agencies and US science and competitiveness. This is a marathon, not a sprint.  There are still many looming difficulties - staffing cuts are well underway.   Spending of already appropriated funds at agencies like NSF is way down, leading to the possibility that the executive branch may just order (or not-order-but-effectively-order) agencies not to spend and then claw back the funds.  This year and in future years they could decide to underspend appropriations knowing that any legal resistance will take years and cost a fortune to work its way through the courts.  This appropriations battle is also an annual affair - even if the cuts are forestalled for now (it is unlikely that the executive would veto all the spending bills over science agency cuts), this would have to happen again next year, and so on. Still, right now, there is an opportunity to push against funding cuts.  Failing to try would be a surrender. (Obligatory notice:  yes, I know that there are large-scale budgetary challenges facing the US; I don't think destroying government investment in science and engineering research is an intelligent set of spending cuts.)

17 hours ago 2 votes
In a First, Solar Was Europe's Biggest Source of Power Last Month

For the first time, solar was the largest source of electricity in the EU last month, supplying a record 22 percent of the bloc's power. Read more on E360 →

18 hours ago 2 votes