The Statisticator

Sunday, January 26, 2025

The Capitalistic Whirlpool (or "why there is mathematically no hope the players will vote to stop playing Squid Game")

Squid Games season 2 was a highly anticipated new season, and it certainly did not disappoint! Before I write another sentence, please be mindful some very high level spoilers will here be revealed, rest assured, nothing along the lines of “Player XYZ dies in episode 4”, but we will be digging into some of the rules of the game, so if you’re planning on watching Squid Games some day and want to be 100% surprised by it, please don’t read any further!

As a quick recap and to accurately define our variables, the high level rules in Squid Game are as follows:

456 players are in the overall game
The game is split into individual challenges
Every time a player dies in a challenge, 100 million wons are added to a common piggy bank ( from here on out we will refer to these 100 million wons as ‘player value’)
After each challenge, players can vote on whether to continue for another challenge, or stop there and split the money accumulated in the piggy bank equally amongst the survivors

For example, if 10 players die in the first challenge, then 1 billion wons (10 times 100 million wons) will be added to the piggy bank, and if the remaining 446 players (456 original players minus the 10 dead) decide to stop after that first challenge, each will leave with 100 million * 10 / 446 = 2.24 million wons.

Note 1: it is sometimes difficult to grasp these very large amounts in Korean wons. 1 won being equal to just 0.00069 dollars, each player’s value of 100 million wons is equivalent to about $69,000.

Which brings us to today’s first question: what does the curve for payoff-per-survivor look like as a function of the number of players having died so far?

We’ve seen what one data point on that curve looks like for 10 deaths, but we can easily generalize:

Average survivor payoff = (money in piggy bank) / (number of survivors)

= $100 millions * number of deaths / (456 - number of deaths)

And the curve looks something like (y-axis in billion wons):

We can zoom in on the first 0-400 deaths (y axis in million wons):

That’s quite the hockey stick shape if I ever saw one! But it shouldn’t come as too much of a surprise as both numerator and denominator favor many deaths: as the number of deaths increases, so does the piggy bank amount, but the number of survivor decreases, the hen taking the ratio we have something that really shoots up with a maximum of 45.5 billion wons!

Note 2: In the series, it is often said the single winner leaves with 45.6 billion wons, I’m guessing the organizers of the game might throw in an extra 100 million wons to the jackpot which corresponds to the ”value” of the player.

Note 3: Back to our conversions, the total jackpot of 45.6 billion wons is equivalent to approximately $31.5 million.

Now for a trickier question: in the series, the first post-challenge votes are all in favor of continuing to play extra challenges, why is that?

This is a much more subjective question, and of course each of the 456 players has their own rationale, debt, thresholds, risk aversions… But based on the game set-up we can take a stab at understanding how players approach an extra challenge.

Let’s first answer another question: if every player’s “value” is $100 million wons (value that gets added to the piggy bank if they die), how many deaths need to occur for the average payoff to be that same value?

Given the formula previously derived, we can easily see that if average payoff is equal to player value, then number of deaths / (456 - number of deaths) must be equal to 1, meaning there are as many deaths as survivors, and so exactly half the initial players died. 228 players!

What about if players want to leave with double their value, or triple, or multiply by 10? If we define f as the factor of how many multiples of their value survivors leave with, we have:

f = number of deaths / (456 - number of deaths) which w can flip to:

number of deaths = 456 * f / (f + 1)

So to double payoff, 304 deaths would be required which is only 76 more than the 228 that were required to get the player value as payoff!

And so the minimum number of deaths required to get multiples of the $100 million player value is:

Factor	Survivor Payoff (million wons)	Minimum number of deaths	Incremental deaths required
1	100	228	228
2	200	304	76
3	300	342	38
4	400	365	23
5	500	380	15
10	1000	415	35

And we can see the full visual here:

It takes fewer and fewer deaths to get significantly higher payoffs, which is of course another interpretation of the very first hockey stick figure we saw.

Here's an alternative ingographic-style view:

Back to our question, why do players keep voting for an extra challenge despite the risk lethality?

Based on our previous work, we can split the decision-making into two phases:

Phase 1

Players in the game usually have rather high debts, and payoffs are really horrible at the very beginning given how many players are still alive to divide up the piggy bank amount. In this phase payers need to continue to actually get something out of having risked their lives.

Phase 2

At some point, quite a few survivors could leave with a payoff sufficient to cover their debts, so the incentive to stay should decrease? But another phenomenon kicks in, the average payoff is starting to skyrocket. Every time a few extra players die, the average payoff starts multiplying! It is therefore likely that many players will start thinking along the lines of “I survived 342 other deaths which is a survival rate 25%, surely I can survive another 23 deaths of the remaining 114 players (survival rate of about 80%), and leave with quadruple the player value instead of just triple!”

The organizers of the games have masterfully set-up the payoff system to generate and combine these two psychological effects in order to ensure games would continue through many challenges to the greatest delight of the VIP watchers… and to all Netflix subscribers!

Tuesday, December 17, 2019

Can the 2019-2020 Lakers get 70 wins?

I know I know, it's way too early to have this conversation.
If we're already having it with less than a third of the season under our belts, can you imagine what it will be like for the rest of the way? "70" might be a top Google search in 2020!

That being said, the conversation has already started, here's the record for the '71 Lakers, blablabla, very similar, blablabla... So it's difficult to block it out entirely and as a big fan of visualization, even more difficult not to add a couple of visuals to avoid all the number comparisons we're seeing.

Very quick background: the NBA regular season consists of 82 games. 70 was a mythical number of wins no team seemed able to reach, the Lakers getting the closest with 69 in 1972. That all changed in 1996 when Michael Jordan's Bulls went on to win 72. Twenty years later, the Golden State Warriors were able to inch just a bit further with 73 wins which is the record as of today. (People will routinely point out that while the Bulls won the championship that year, the Warriors lost the Finals in 7 games...). The year after they got 72 wins, the Bulls were close to repeating the 70-win feat but lost their last two games and ended at 69. They did however win the championship again that year.

This year, the Los Angeles Lakers are off to a really hot start, and after the Bucks' loss yesterday, lead the NBA with a 24-3 record. This hot start has naturally fueled the 70-win conversation, so how do all these teams stack up at this point of the regular season?

Here's a game-by-game evolution of those four teams' win percentages:

The gray horizontal line is 83.4% representing the 70 win threshold. What really stands out here is the overall downward trend as the season progresses. It took a while for the '71 to get above the 70-win threshold , but once above it seems a very difficult level to maintain. No team was just under the limit to finally make it above in the final stretch of the season.

While the Lakers are currently tied with the '95 Bulls, '96 Bulls and '71 Lakers, they will need to keep accumulating wins at a higher than 83.4% win rate to provide an acceptable cushion for the downward end of year trend.

As for the rationale behind that downward trend, it could be due to fatigue, although one could argue that all teams should be similarly affected, but more likely it is due to final Playoff standings starting to fit in. The 7th, 8th, 9th and 10th seed are all fighting for the last two Playoff spots and will do whatever it takes to win.

It could very well be that LeBron and company will have to make some late season decisions as to whether they want to pursue the 70-win mark or maximize their chances for title. Officially they'll declare the latter as being their priority, but we all know that if LeBron can add a 70-win season to his resume he wouldn't spit on that. He also knows first-hand the risk associated with chasing the wrong priority, as his team was the one who beat the 73-win Warriors....

Tuesday, December 3, 2019

Lego: Christmas gift without bricking the bank

Four years ago right around Christmas time, I had posted an analysis of how the prices of Lego sets were related to their number of pieces.

Checking a Lego Christmas catalog last week, I had the feeling that:

prices had gone up in general
yet the 10-pieces-per-dollar ratio I had estimated 4 years ago still appeared to be valid.

This conclusion would then naturally be that the number of pieces had also gone up.

I pulled all the prices and number of pieces by Lego theme (over 40 nowadays!), and revisited the analysis.

The data needed some clean-up again, especially regarding sets with with abnormal prices for the number of pieces: the Duplo sets (oversized blocks are much more expensive), the Mindstorms sets (the color sensor is over $40) and Powered-Up sets (the move hub is $80). Similarly to last time, I also split themes into those that are based on an external partnership (Star Wars, Harry Potter, Marvel, DC...) and those entirely developed by Lego (City, Friends, Creator Expert...).

Here's the plot of pieces versus price for all sets less than $200 (95% of sets), and removing the outliers mentioned in the previous paragraph:

It's quite tempting to look at the range for the pieces-to-price ratio across sets, splitting by theme:

Out of the top 10 themes with best median ratio, 9 are themes Lego developed in-house, and only one is through external partnership (car makers, deal might not have been as lucrative as for Marvel or Star Wars rights).

We can fit a linear model to fit the number of pieces against price, partnership, and their interaction:

The result is that for non-partnership sets, we get a pieces-to-price ratio of approximately 10.7, ratio that drops to just over 9 for the partnership sets. The difference is highly significant and confirms our previous observations. The intercepts are slightly different but not significantly so.

A question we might be interested in is whether we observe any diminishing or increasing returns as the prices go up. This can be done by observing the pieces-to-price ratio for various pice buckets:

As we might have expected, our pieces-to-price improves as we go into the more expensive sets. This makes sense as the pieces are only one of the costs for sets, there are many other fixed costs in terms of packaging and marketing that will make the smaller sets less attractive to the smart buyer.
Non-partnership sets appear to always outperform the partnership sets, but one should keep in mind that the sample size is pretty small in the $200+ bucket.
The previous result also throws some shade on our linear models, the slopes (pieces-to-price ratio) are actually not constant. If we wanted to improve the models, we would probably need to include a quadratic term for price.

So to recap, your best pieces per dollar will come from non-partnership and bigger sets. Of course it could be your child has a specific theme in mind, so to help out with your Christmas purchases, here's the list of the best sets for each theme:

Theme	Product Name	Pieces	Price	Ratio
Architecture	trafalgar-square	1197	79.99	14.96
Boost	droid-commander	1177	199.99	5.89
Brickheadz	thanksgiving-scarecrow	177	9.99	17.72
City	pizza-van	249	19.99	12.46
Classic	bricks-bricks-bricks	1500	59.99	25.00
Creator-3-in-1	deep-sea-creatures	230	14.99	15.34
Creator-Expert	taj-mahal	5923	369.99	16.01
Dc	1989-batmobile	3306	249.99	13.22
Disney	the-disney-castle	4080	349.99	11.66
Disney-Frozen-2	elsa-s-magical-ice-palace	701	79.99	8.76
Duplo	creative-fun	120	39.99	3.00
Fantastic-Beasts	newt-s-case-of-magical-creatures	694	49.99	13.88
Friends	underwater-loop	389	29.99	12.97
Harry-Potter	hogwarts-castle	6020	399.99	15.05
Hidden-Side	shrimp-shack-attack	579	49.99	11.58
Ideas	lego-nasa-apollo-saturn-v	1969	119.99	16.41
Juniors	city-central-airport	376	49.99	7.52
Jurassic-World	jurassic-park-t-rex-rampage	3120	249.99	12.48
Lego-Batman-Sets	1989-batmobile	3306	249.99	13.22
Lego-Spider-Man	spider-mech-vs-venom	604	49.99	12.08
Marvel	the-hulkbuster-smash-up	375	29.99	12.50
Mindstorms	lego-mindstorms-ev3	601	349.99	1.72
Minecraft	the-wool-farm	260	19.99	13.01
Minifigures	mf-set-ninjago-2019	59	12.99	4.54
Ninjago	ninjago-city	4867	299.99	16.22
Overwatch	dorado-showdown	419	29.99	13.97
Power-Functions	lego-power-functions-train-motor	7	13.99	0.50
Powered-Up	disney-train-and-station	2925	329.99	8.86
Powerpuff-Girls	mojo-jojo-strikes	228	29.99	7.60
Serious-Play	window-exploration-bag	4900	484.99	10.10
Speed-Champions	formula-e-panasonic-jaguar-racing	565	29.99	18.84
Star-Wars	yoda	1771	99.99	17.71
Stranger-Things	the-upside-down	2287	199.99	11.44
Technic	cherry-picker	155	9.99	15.52
The-Lego-Movie-2	pop-up-party-bus	1024	79.99	12.80
The-Lego-Ninjago-Movie	ninjago-city	4867	299.99	16.22
Toy-Story-4	woody-rc	69	9.99	6.91
Unikitty	unikitty-cloud-car	126	9.99	12.61
Xtra	traffic-lights	46	3.99	11.53

Now one important caveat should be mentioned here. Number of pieces is a great proxy for size and time to build, but it isn't perfect. Going through some sets I've noticed a tendency of adding more and more small decorative pieces. This trick can significantly and artificially bump up the piece number. Ideally we should also include weight to control for that aspect, but weight was not available and would probably not have been available at the desired granularity level. Take the new gingerbread house for instance. At $99 for 1477 pieces it seems like a great deal, but take a closer look at what's inside:

Look at all those small pieces! Especially the red and white candy canes around the doors and windows. Just tiny 1-by-1 round shapes staked together. Almost 100 of them, and I doubt they're worth $10!

Now of course, these were just models, and you can throw them all out the window when it comes Ole Kirk's house. Who? Ole Kirk as in Ole Kirk Christiansen, the man who created The Lego Company. His house is an extremely rare Lego set which was gifted only to employees (and a few special visitors). At 912 pieces we would roughly estimate it at $90, but good luck finding it at less than $300, prices of $500 are not uncommon for unopened sets!

Thursday, November 21, 2019

Twitter, Trump & the Economy

Mentioning Twitter and Trump in the same sentence has become a lapalissade over the last few years.

It has reached the point where each new tweet is logged, analyzed, correlated. For a full comprehensive archive, I strongly recommend the Trump Twitter Archive. They're all there available for any analysis you can think of!

I recently stumbled across multiple articles on how the number of tweets from Trump were negatively correlated with stock market performance. A few examples:

The papers don't say it explicitly, but highly suggest causality, something we'll get back to later.

Reading all this, I decided to take a quick stab and see if:

there were any interesting trends in Trump's Twitter usage that hadn't already been reported
could replicate some of the stock market findings

Twitter activity

Much has already been said on Trump's Twitter patterns, and using activity as a proxy for determining when he sleeps, here are heatmaps for orginal tweets, retweets, total tweets and fraction of retweets:

Very dark from midnight to 6am as one would expect. We can also spot some sweet spots for original content on Monday mornings to kick off the week, and that same day right before midnight (lighter color zones in the upper right side graph).

I did want to explore the evolution of the fraction of retweets over time (# retweets / # total tweets). Very noisy data as one can imagine so slapped a smooth curve on top and the result is quite striking:

I'm guessing the retweet data wasn't collected prior to 2016, but since being elected there has been a clear upward trend.

It could be a sign that less time is available to generate a tweet from scratch and retweeting is a simpler way of manifesting presence, or it could be a way of building support ahead of the upcoming elections. It will definitely be interesting to continue monitoring this metric over the next few months and see if we ever cross the 50% mark.

Stock market performance

The original intent was to re-run the analysis mentioned ion various articles about the negative correlation between number of tweets and stock market performance.

The articles were not entirely clear on what data was being used, the best description I could get was:

"since 2016, days with more than 35 tweets (90 percentile) by Trump have seen negative returns (-9bp), whereas days with less than 5 tweets (10 percentile) have seen positive returns (+5bp) — statistically significant."

So time period will be Jan 1st 2016 until today (a little more data than what was originally used). In terms of tweets, it is not entirely clear if retweets are included or not, I will test both alternatives. As for the returns, I will look at three market indicators: Dow Jones, S&P500 and Nasdaq. Two more alternatives were possible based on whether we inspect changes in market performance in terms of absolute or relative changes.

Similarly to the quote, I identified in each case the 10th and 90th tweet percentile, and ran a simple t-test for market performance on days with less than the 10th percentile versus days above the 90th percentile. Unfortunately, in none of the 12 cases was I able to identify a significant relationship between Trump's Twitter activity and market performance:

There could be multiple reasons why I wasn't able to replicate the findings, from the data source (a good indication of this comes from the fact that my 10th and 90th quantile are lower than the values reported in the articles) to the formatting (I used Eastern time as the reference time but using a different time zone could result in tweets being shifted to another day) to the technique used for detecting a difference (though I also ran other non-parametric tests and results did not change).

One could be surprised by the lack of significance when seeing that Nasdaq has an average uplift of 0.2% on low tweet days but only 0.06% on high tweet days, but this is easily explained when visualizing the actual distributions and large standard deviations:

Independently of all these potential causes though, I feel that more importantly than the result is the interpretation of the result.

Correlation VS Causation

Let us assume we had surfaced an incredibly tight correlation between the two metrics: the number of tweets by Trump and market performance. What would that have meant, if anything?

When correlation between two variables A and B is observed, a lot could be going on behind the scenes:

A and B have nothing in common and the observed correlation is purely coincidental. This is called a spurious correlations. One of my favorite examples is the correlation between "the number of people who drowned by falling into a pool" and "the number of movies Nicolas Cage appeared in" (example taken from https://www.tylervigen.com/spurious-correlations which has lots of other great ones!)

It might seem that A causes B but it could actually be the reverse. For instance, it could seem that people going more frequently to the gym have lower BMIs, but perhaps those with lower BMIs tend to go more frequently to the gym...
The most vicious situation is that of the confounding or hidden variable that has a causal impact on both A and B separately. We might see a link between carrying a lighter and risk of lung cancer, but neither affects the other, the real hidden variable is whether the person is a smoker or not.

Back to the tweets. Assume there is a strong significant correlation between volume of tweets and market performance, one could argue there are other explanations than sheer volume creating market panic. Perhaps it is the reverse effect: when markets perform poorly, Trump is more likely to tweet and attempt to re-assure. Or there could be hidden variables in other economical events: if there are trade tensions with other countries, Trump could be more likely to tweet about those while in parallel the market tends to panic and go down...

Anyway, this was just a little tweet for thought...

Tuesday, September 10, 2019

How elusive was 42 as the sum of three cubes?

The big mathematical news September 9th 2019 was the final resolution of the decade old Diophantine equation.

As is often the case for super tough questions, the question itself seems completely trivial.
Can all the numbers from 1 to 100 be written as the sum of the cubes of three integers,
i^3 + j^3 + k^3?

Of course the key part of the question is integers, otherwise 0^3 + 0^3 + (cubic root of x)^3 will return x for any x we want!

The other subtlety is that integers can be negative, so each cube can be either positive or negative. So 92 can be written as 9^3 + (-8)^3 + (-5)^3.

This puzzle has been around for at least decades in this current form, but similar equations with integer solutions have been around at least since the third century!

Four our problem, solutions for almost all numbers 1 through 100 had been found, but two remained elusive 33 and 42.

And then 33 was found earlier this year:
(8866128975287528)^3
+ (−8778405442862239)^3
+ (−2736111468807040)^3 = 33

Which left 42, but advances on the theory and on the computing side have cracked the last remaining elusive number:
(80435758145817515) ^ 3
+ (-80538738812075974) ^ 3
+ (12602123297335631) ^ 3 = 42

Now I will describe man algorithm to generate all numbers 1 through 1000....
Just kidding.

I was actually curious of the elusiveness of these numbers 33 and 42. Mathematicians had to go really far to get the right numbers. Is that because numbers just weren't very likely to fall in the 1 to 100 range (very likely given how quickly cubes get!) or are certain numbers much more likely to get all the attention. in other words, as all combinations of three integers were tested, were all numbers 1 through 100 obtained in roughly equal proportions or did frequency concentrate around certain ranges?

I wrote a small script which explored this, simply generating all possible combinations of three integers (up to -1000 to 1000 in range, didn't feel like going all the way up to 80538738812075974...) and look at the distribution of i^3 + j^3 + k^3 in the 1 to 100 range.

With i,j,k in -10 to 10

With integers just in that small range, already 61 out of the 100 numbers are solved!
The distribution is far from being randomly uniform though! Two things jump out:

the spikes: These are perfectly normal! If a number is a perfect cube (i.e. 27), then there is an infinite number of solutions setting the two other values to opposite values: 27 = 3^3 = 3^3 + 0^3 + 0^3 = 3^3 + (1761)^3 + (-1761)^3... And you can add in all the permutations!
the clusters: it seems the solved numbers go in groups. Again, quite reasonable if we think about it. If a number can be generated as the sum of two cubes, then we can easily generate the one immediately before and after by adding either 1^3 or (-1)^3. We have an extra degree of freedom around the perfect cubes. Being able to generate consecutive solved numbers creates the clusters. There is a further discussion at the end of the post on how unexpected the number of clusters is.

Based on the spike comment, I will truncate at frequencies of 10 to focus on coverage:

With i,j,k in -100 to 100

Expanding the range ten-fold leads to 68 out of the 100 being solved:

With i,j,k in -1000 to 1000

Increasing the range ten-fold once again leads to a single additional number being solved: 51

We now have a better sense of the elusiveness of the solutions. If they aren't found with the range of the smaller integers, then it will get increasingly difficult to identify later, no wonder this has been such an open problem for so many years!

Just for fun, I looked at a new problem switching the power from 3 to 5, only 13 (compared to 69 for cubes) numbers were solved in the -1000 to 1000 range:

Again the clustering is pretty remarkable! While 42 is still unsolved, I'm glad to report a solution for 33: 0^5 + 1^5 + 2^5 !

And to close out this post, I wanted to return to the topic of the clusters. We saw that when exploring the -1000 to 1000 ranges for the integers we were able to cover 69 out of the 100 target numbers. But it didn't appear as though the 69 numbers were randomly distributed in 1-100 but grouped into clusters, more specifically into 13 clusters. We had a high-level explanation for this as consecutive numbers can easily be generated via k^3 with k = -1, 0 and 1. But the question remains, how non-random is this grouping into 13 clusters?

One approach is to sample without replacement 69 out of the 100 first integers, and see how many clusters are obtained. In order to count clusters, I used the rle() function which stands for run length encoding and basically describes a sequence. E.g. for 1,2,2,2,1,0,0 it will return that there is one 1, three 2s, one 1 and two 0s. From there it is easy to get the number of clusters.

I generated 10,000 samples of "solved" numbers between 1-100 and in each case computed the number of clusters obtained. Here is the associated histogram to which I've added the 13 we obtained in our case as a red vertical line:

While we'd typically expect to get about 20-25 clusters, only in extreme cases do we get a 15 or even a 14. Our 13 was not obtained a single time in all ten thousand simulations. This should settle the question of whether the low number of clusters comes as a surprise or not!

Friday, August 30, 2019

Disney vs Fox: Movie Performance Comparison

A few weeks ago 'Dark Phoenix' was a very hot conversation topic in the movie business, not for the movie itself, but because it symbolized the difficult Fox acquisition by Disney.

While Disney movies have done fairly well in 2019 to put it lightly (5 movies generating over $1Billion so far, and it's only August with "Frozen 2" and "Star Wars 9" still due....), Fox is under very tight scrutiny and 'Dark Phoenix' has been heralded as the barometer for Fox movie success/failure. Especially after Disney CEO Bob Iger declared during quarterly earnings call that “the Fox studio performance … was well below where it had been and well below where we hoped it would be when we made the acquisition.”

While technically profitable thanks to higher-than-expected international results (latest IMDB results have $250M gross worldwide revenue with analyst forecasts around $280M for an estimated budget of $200M), 'Dark Phoenix''s performance will continue to be observed and scrutinized.

But this is just one movie we're talking about! Is it really fair to reduce a movie studio the size of Fox to a single movie? 20th Century Fox has been around for a while, and I was curious to compare their historical performance compared to Disney's.

First things first, data is required! I pulled all of Disney's and Fox's movies from IMDB, including key metrics in the process: IMDB rating, metascore, duration, genre, estimated budget, gross revenue...

Naturally some filtering was required to focus solely on non-short non-documentary movies released after 2000 with non-missing data.

In addition to Fox and Disney movies, I created a third category for movies produced by both.

The next plot compares the performance of each of the 512 movies that matched are criteria (331 Fox, 181 Disney), the x-axis is the worldwide revenue, y-axis is IMDB rating. Bubbles are color-coded by company and the size of the bubble is proportional to the movie's ROI (gross worldwide revenue / estimated budget):

A log-transform of the x-axis provides more insight into the lefthand-side blob, but loses the magnitude of the super high-performing movies:

It would appear that Disney dominates the high revenue end of the spectrum, with 22 out of 23 of movies generating over a billion dollars coming from Disney, the sole exception being 'Transformers: Age of Extinction', things are much more murky under that threshold.

Given the difficulty of separating the two classes I tried fitting an XGBoost model which was capable of classifying movies from each company solely based on IMDB rating, metascore, budget, world revenue and movie duration with over 75% accuracy on the test dataset, pretty remarkable, wouldn't you agree? Although to be fair, a simple logistic regression was capable of achieving a reasonable 65% accuracy.

It'll be interesting to see how the next wave of Fox movies perform, perhaps attempt a before/after comparison of the Disney acquisition...

Friday, June 28, 2019

Solving Mathematical Challenges with R: #3 Sum of all grids

Welcome to our third challenge!

You can find more details on where these challenges come from and how/why we will be approaching them in the original posting.

Challenge #3

The original link to the challenge (in French) can be found here. The idea is to consider an empty three-by-three grid, and then place two 1s in any two of the nine cells:

The grid is the completed as follows:

pick any empty cell
fill it as the sum of all its neighbors (diagonals as well)

I've here detailed an example, the new cell is indicated at each step in red:

The challenge is now to find a way to complete the grid in such a manner as to obtain the greatest value in a cell. In the previous example, the maximum obtained was 30, can we beat it?

Power grid

I strongly encourage you to try out a few grids before continuing to read. Any guesses as to what the greatest obtainable value is?

As you try it out, you might realize that the middle cell is rather crucial, as it can contribute its value to every other cell, thus offering the following dilemma:

fill it too soon and it will be filled with a small value
fill it too late and it will not be able to distribute its value to a sufficient number of other cells

Gotta try them all!
In the spirit of our previous posts, let's try all possibilities and perform an exhaustive search.
The first thing we need to ask ourselves is the number of possible ways of filling up the square, and then generating all those combinations in R.
Ignoring how the cells are filled up, there are 9 possibilities to fill up the first cell (with a 1), 8 possibilities for the second (also a 1), 7 for the third (with the rule)... and 1 for the ninth final cell. Which amounts to 9*8*7*...*1 = 9! = 362880, piece of cake for R to handle.

Generating permutations
So we want to generate all these possibilities which are actually permutations of numbers 1 through 9, where each number corresponds to a cell. {7,1,5,2,4,9,3,8,6} would mean filling up cells 7 and 1 first with a 1, then filling up cell #5, then #2 and do on.
A naive method would be to use expand.grid() once again for all digits one through 9, which would also give unacceptable orders such as {7,1,5,2,1,3,6,8,9} because cell #1 is filled twice at different times! We could easily filter those out by checking if we have 9 unique values. But this would amount to generating all 9*9*9..*9=9^9>=387 million combinations and narrow down to just the 362K of interest. That seems like a lot of wasted resources and time!

Instead, I approached the problem from a recursive perspective (R does have a package combinat that does this type of work, but here and in general in all the posts I will try to only use basic R):

If I only have one value a, there is only a single permutation:{a}
If I only have two values a and b, I fix one of the values and look at all permutations I have of the left-over value, then append the value that was fixed:
- fix a, all permutations of b are {b}, then append the a: {b,a}
- fix b, all permutations of a are {a}, then append the b: {a,b}
- the result is {a,b} and {b,a}
with three values, repeat the process now that we know all permutations of 2 values:
- fix a, all permutations of b,c are {b,c} and {c,b}, then append the a: {b,c,a}, {c,b,a}
- fix b, all permutations of a,c are {a,c} and {c,a}, then append the b: {a,c,b}, {c,a,b}
- fix c, all permutations of a,b are {a,b} and {b,a}, then append the c: {a,b,c}, {b,a,c}

In R:
SmartPermutation <- function(v) {
if (length(v) == 1) {
return(as.data.table(v))
} else {
out <- rbindlist(lapply(v, function(e) {
tmp <- SmartPermutation(setdiff(v, e))
tmp[[ncol(tmp) + 1]] <- e
return(tmp)
}))
return(out)
}
}

Computing the score

The next and final step is to fill out the grid for each possible permutation.

This is done by defining a static mapping between cell ID and its neighbors:

We then go through each permutation, fill out a virtual grid (in our case it's not a 3-by-3 but a vector, which doesn't change anything as long as the relationships between neighbors is maintained) and track maximum value.

Neighbors <- function(i) {
neighbor.list <-
list(c(2, 4, 5), c(1, 3, 4, 5, 6), c(2, 5, 6),
c(1, 2, 5, 7, 8), c(1, 2, 3, 4, 6, 7, 8, 9), c(2, 3, 5, 8, 9),
c(4, 5, 8), c(4, 5, 6, 7, 9), c(5, 6, 8))
return(neighbor.list[[i]])
}

CompleteGrid <- function(row) {
grid <- rep(0, 9)
grid[row[1 : 2]] <- 1
for (i in 3 : 9) {
grid[row[i]] <- sum(grid[Neighbors(row[i])])
}
return(max(grid))
}

Final answer
Based on our exhaustive search, we were able to identify the following solution yielding a max value of...57!

Little bonus #1: Distribution
The natural question is around the distribution of values? Were you also stuck around a max value in the forties? 57 seems really far off, but the distribution graph indicates that reaching values in the forties is already pretty good!

Little bonus #2: Middle cell
Do you remember our original trade-off dilemma? How early should the middle cell be filled out? In the optimal solution we see it was filled out exactly halfway through the grid.
To explore further, I also kept track for each permutation when the middle cell was filled out and what its filled value was. The optimal solution is shown in green, the middle cell being filled out in 5th position with a seven.

And a similar plot of overall max value against position in which the middle cell was filled:

It clearly illustrates the dilemma of filling neither too late nor too early: some very high scores can be obtained when the middle cell is filled as 4th or 6th cell, but absolute max of 57 will only be achieved when it is filled in 5th. Filling it in the first or last three positions will only guarantee a max of 40.