Pooled Testing for Viruses: How many tests can it save?

Pooled-Testing-for-Viruses:-How-many-tests-can-it-save?_image

By Max Candocia

|

November 26, 2020

Pooled testing is a method of increasing the efficiency of medical tests, such as COVID-19 detection tests. How efficient is it, though? That largely depends on what percentage of the population is infected.


Candy Combinations for Bundling

Candy-Combinations-for-Bundling_image

By Max Candocia

|

November 13, 2020

What candies would work best in a bundle? Using rankings and correlations, popular candies can be grouped together for optimal combinations.


How to Get Survey Responses from Reddit

How-to-Get-Survey-Responses-from-Reddit_image

By Max Candocia

|

November 12, 2020

If you need more data for a survey, you can use Reddit as a source of responses. In this article, we look at a few factors that affect the success of a survey posted to Reddit.


How Would the US Vote for a Candy?

How-Would-the-US-Vote-for-a-Candy?_image

By Max Candocia

|

October 28, 2020

What would it look like if people across the US voted for a candy? Explore different results using different voting methods and different types of representation, such as a national vote versus the Electoral College.


Hashing Data to Memorable Phrases

Hashing-Data-to-Memorable-Phrases_image

By Max Candocia

|

October 02, 2020

Do you have trouble memorizing long strings, but want to keep things easy to remember? Look no further than the new keyToEnglish package in R, now available on CRAN.


Calculating Similarity of Running Routes

Calculating-Similarity-of-Running-Routes_image

By Max Candocia

|

September 13, 2020

When working with path-like data, such as a run recorded by GPS, you may want to group near-identical routes together. With a handful of data, I demonstrate how similarities can be calculated to find duplicate runs, as well as make general comparisons between runs.


Dealing with Zeros and Negative Values with a Log Scale

Dealing-with-Zeros-and-Negative-Values-with-a-Log-Scale_image

By Max Candocia

|

August 30, 2020

When plotting data, you may want to use a log-scale for most of your data, but zeros, near-zero values, and negative values make this impossible. With piecewise linear and logarithmic functions, however, this effect can still be achieved.


Visualizing Direction in Running Routes

Visualizing-Direction-in-Running-Routes_image

By Max Candocia

|

May 17, 2020

A relatively straightforward method of visualizing the direction of a running path using R and ggmap. This also works for any sort of path data in general.


How Likely Are You to be Banned From Reddit?

How-Likely-Are-You-to-be-Banned-From-Reddit?_image

By Max Candocia

|

April 08, 2020

How Likely Are You to be Banned From Reddit? I got a bot for that.


Outliers in a Triathlon

Outliers-in-a-Triathlon_image

By Max Candocia

|

February 26, 2020

How do you identify an "outlier" in a triathlon?


Hiding Data in Images

Hiding-Data-in-Images_image

By Max Candocia

|

February 23, 2020

Images are one of the most common types of data that people view on the internet, but could they be hiding more than the eye can see?


Visualizing My Runs in 2019 (and 2018)

Visualizing-My-Runs-in-2019-(and-2018)_image

By Max Candocia

|

January 20, 2020

A visualization of my runs in 2019 using R, with buttons allowing comparisons for my runs in 2018.


Hong Kong on Reddit

Hong-Kong-on-Reddit_image

By Max Candocia

|

September 22, 2019

The recent Hong Kong protests have garnered much media attention. Reddit, one of the largest social media communities, has its own communities with different takes on the protests. Here, we take a look at four notable ones.


Pacing at the Boston Marathon

Pacing-at-the-Boston-Marathon_image

By Max Candocia

|

May 04, 2019

Insights and animated charts of the paces and splits of Boston Marathon runners.


Puns, Ranked from Best to Worst

Puns,-Ranked-from-Best-to-Worst_image

By Max Candocia

|

April 01, 2019

After 508 individuals each rated some puns, I sorted them by average score. How far down the list can you go?


Modeling Heart Rate Recovery with Nonlinear Regression

Modeling-Heart-Rate-Recovery-with-Nonlinear-Regression_image

By Max Candocia

|

January 09, 2019

Nonlinear regression models can succeed where linear models fail and highly complex models cannot be interpreted. Using heart rate data I collected from my runs, I demonstrate how my heart rate recovers after stopping as a function of temperature and rest time.


What is my Lottery Ticket Actually Worth?

What-is-my-Lottery-Ticket-Actually-Worth?_image

By Max Candocia

|

December 09, 2018

When you buy a lottery ticket, how much is it worth to you? Is the giant jackpot the main draw, or do you find the other prizes alluring? Using expected utility, we can see that tickets are worth much less than you may already think.


Evaluate My Lottery Ticket!

Evaluate-My-Lottery-Ticket!_image

By Max Candocia

|

December 07, 2018

What is the expected value of a lottery ticket, and is it actually worth it just for the jackpot? With this tool, you can look at what any number of tickets are worth, with a highly customizeable input.


What Are The Chances That Your Vote Makes a Difference?

What-Are-The-Chances-That-Your-Vote-Makes-a-Difference?_image

By Max Candocia

|

November 12, 2018

What is the chance that your vote, or even all of your friends votes, matter in an election? This is a small application that can figure that out for you, accompanied by some graphs and explanations that should make it easier to understand.


What Age do Kids Start Going Trick-or-Treating, and When do They Stop?

What-Age-do-Kids--Start-Going-Trick-or-Treating,-and-When-do-They-Stop?_image

By Max Candocia

|

October 22, 2018

When do kids start going trick or treating, and when are they "too cool" to continue going? Using survey data and a variety of statistical techniques, answers to these questions can be found to a certain level of statistical confidence.