Comments on E's flat, ah's flat too: How to distinguish fake coin tosses...

I did the test of gzip. I randomly generated 10000...

2016-05-13T23:25:18.952+05:30

I did the test of gzip. I randomly generated 10000 different 100-length string and zipped them. The length of zipped result follow a normal distribution with mean=43.7683 and std=1.852. The first string's zipped result length is 39 (pdf=0.00783, and 1.66% of the test results) and the second is 43 (pdf=0.19765 and about 17.44% of the test results).

Also, I did the full test for P(Xn | X_(n-1) ). the result is:
(format: [[P(t|t), p(h|t)], [p(t|h), p(h|h)]])
for string 1:
[[38, 22], [22, 17]]
for string 2:
[[14, 31], [31, 23]]

Further, I proposed another method to test the randomness
first, if we interpret the sequence as a sequence of 1-d points, we could count the density of all possible 1-d points.
For example, for sequence [0,0,1,1], there are two 1-d points. 0 occurs twice and 1 occurs twice, and the result follows the uniform density along all possible 1-d points.

Then, what if we interpret the sequence as a higher-dimensional points? For example, we can treat [0,0,1,1] as two 2-d points (0,0), (1,1) and the result is not uniform along all 2-d points [(0,0), (0,1), (1,0), (1,1)].

If a sequence is truly random, then it should follow the uniformness along all dimensional interpretation.

Therefore, I did the test of 2-d interpretation on your sequences.
(format: [P([0,0]), P([0,1]), P([1,0]), P([1,1])])
first sequence:
[0.5, 0.16, 0.18, 0.16] std=0.14
second sequence:
[0.22, 0.3, 0.3, 0.18] std=0.052

From those results, I think the second sequence has more probability to be "truly random".

Anonymous, nice point, and am wondering how you ca...

2014-03-21T12:31:53.022+05:30

Anonymous, nice point, and am wondering how you came across this blogpost after so many years!

Another test of randomness that I think might be p...

2014-03-21T12:12:40.491+05:30

Another test of randomness that I think might be particularly likely for humans to fail at is that the probability of choosing the next result is independent of the previous result. Humans will have a hard time making this true. P(N=H | N-1=H) will almost certainly not equal P(H). On your data set, sequence 2 has the probability that the next result is the same as the previous is only 39/99, much less than the ideal 49.5/99. Sequence 1 on the other hand has a 55/99 chance that the next result is the same as the previous, much closer to the idea. A quick simulation suggests that only 456/10000 pseudorandom runs have a conditional probability as far from ideal as 39, whereas 3090/10000 have conditional probability as extreme as 55. The implied p-value for Sequence 1 being non-random is 0.31, whereas the implied p-value for Sequence 2 being non-random is 0.046, just low enough to reject Sequence 2's randomness if the standard cutoff of 0.05 is used. A longer sequence would of course narrow the spread on the truly random sequences, while probably not changing too much for the human, so this method should be better and better for longer sequences. Alternatively it should also perform better on more naive humans, who will tend even more to want to avoid runs. It looks like 100 flips is a borderline case for sophisticated humans.

Rahul Thanks. I was hoping I would be right, or mi...

2010-08-21T23:15:18.538+05:30

Rahul
Thanks. I was hoping I would be right, or might've looked quite silly with that answer :)

For the algorithm:
Collect a large number of human-generated sequences and an equal no of computer generated sequences in a database (the more the better). Then use bayesian methods (which i am not yet very familiar with) to determine probability of the given input sequence being random.

Prithwiraj: nice answer and you're right. The...

2010-08-21T08:50:30.865+05:30

Prithwiraj: nice answer and you're right.

The next question is, can one write a computer program to do the job? The program should take an input sequence, and return the probability that it was generated by a human (or other nonrandom process); and its confidence should increase with the size of the input.

crude answer: sequence 1 has 60H and 40T and start...

2010-08-21T02:06:59.113+05:30

crude answer:
sequence 1 has 60H and 40T and starts with T
sequence 2 has 54H and 46T and starts with H

very likely you did sequence 2 by hand, and subconsciously
1.tried to have close to 50% tails. in reality, 50-50 would occur in much larger samples
2. began with H

Thanks, I completely missed the distinction betwee...

2010-08-19T13:21:34.794+05:30

Thanks, I completely missed the distinction between HHHH.. and THHH...

km - ps: about Deolalikar, I do find it disappoint...

2010-08-18T11:15:14.209+05:30

km - ps: about Deolalikar, I do find it disappointing that, after having obtained a huge amount of feedback from eminent people (including at least two Fields Medallists) for his earlier manuscript, he has chosen to restrict his newer manuscript to a "small number of researchers", and -- at least on his web page -- fails to acknowledge the usefulness of any of the previous feedback.

If he is really on to something, I would think the open method would be best.

km -- it's a bit beyond me, and others are doi...

2010-08-18T11:09:09.617+05:30

km -- it's a bit beyond me, and others are doing a great job -- in particular Richard Lipton. The current consensus seems to be that the proof is flawed, probably fatally so, and the remaining question is whether any interesting results can still be retrieved.

Anonymous: well, that article also talks about heads specifically, not "heads or tails". But the approach is similar to what I used, except I build it up 5 at a time rather than 1 at a time, and get a formula rather than write a computer program.

Kapil -- I'll hold off commenting yet on whether you are right, but -- as you know -- a number like 62 or 48 is not very significant without knowing how much it will vary. So for example, if you generated 1000 random 100-long sequences of H/T, gzipped them all, and calculated the mean/variance of the zipped files, and then found that one of my sequences was well within your expected range and the other was well outside, then that would be pretty convincing...

as - no, that is why I am writing sequences like THHHHHHNNN -- to avoid counting examples that were already counted in HHHHHHNNNN.

I just looked up Feller(Vol 1. Sec XIII.8) and he ...

2010-08-18T07:23:54.182+05:30

I just looked up Feller(Vol 1. Sec XIII.8) and he discusses the case of consecutive H or T and writes down an explicit formula for mean time before we get such a run. Am confident that the formula exists for the probability and is hidden in a mid term question heap in the Princeton basement.

Thank you for putting up a solution. One question though, I was wondering if you were over counting a little bit, as in when looking from N-5 to N+5, how many times would you be counting solutions like HHHHHHHHHH(10 times). Maybe, I missed something.

A fairly standard test for randomness is Maurer...

2010-08-18T07:08:04.902+05:30

A fairly standard test for randomness is Maurer's test which checks
how compressible the sequence is. The problem with that test is that
it takes too long on data-sets that are large enough and it is not
significant on data-sets that are small! Still, why not use it?

Anyway, I just took your seq1 and seq2 and applied 'gzip -9' on them.
Here is what I got:

58 /tmp/seq1.gz
62 /tmp/seq2.gz

I then applied 'lzma --best' on both.

47 seq1.lzma
48 seq2.lzma

This seems to indicate that the first sequence is less random than
the second.

I begin to lose you in the third section, but how ...

2010-08-18T00:02:45.823+05:30

I begin to lose you in the third section, but how about this for a precise answer?

http://www.win.tue.nl/~iadan/blockq/rows.pdf

I didn't see you blog about Deolalikar's N...

2010-08-17T19:17:34.274+05:30

I didn't see you blog about Deolalikar's N = NP? Any thoughts?