Chat or Talk in the INReview Discussion Forum Chat or Talk in the INReview Discussion Forum
Support INReview. Please visit our sponsors and shop.
 
register chat shopping members links refer search home
INReview INReview > Archives > Science > Mars > Help crack the code!
Search this Thread:
Pages (3):  [1] 23 »   Print Version | Email Page | Bookmark | Subscribe to Thread
Author
Thread Post New Thread   
Staff
Sean Kelly
Free Thinker

offline
Registered: Jan 2003
Local time: 07:17 PM
Location: Silicon Valley
Posts: 4292

Help crack the code! post #1  quote:



This invitation is extended to anyone interested in helping crack the code. If we get it figured out, we can all submit the result and be entered into the drawing to win the silly prizes.



Here's the contest URL: https://planetary.org/redrover-dvd/dvd_spirit.html

I'll work on putting together some more materials here and begin the list of clues and possible thoughts for code-breaking, possibly later tonight.

EDIT: Perhaps I'll end up doing this by myself here, but feedback & input is welcome. Hopefully the incarnations that I go over will be helpful to someone new to data decryption & ciphers who is interested in the subject and would like to know more about what kinds of methodologies may be employed.



Smile; It confuses people.
Old Post 01-10-2004 12:33 AM
Click here to Send Sean Kelly a Private Message Find more posts by Sean Kelly Add Sean Kelly to your buddy list Reply w/Quote

Staff
Sean Kelly
Free Thinker

offline
Registered: Jan 2003
Local time: 07:17 PM
Location: Silicon Valley
Posts: 4292

post #2  quote:




Smile; It confuses people.
Old Post 01-10-2004 12:38 AM
Click here to Send Sean Kelly a Private Message Find more posts by Sean Kelly Add Sean Kelly to your buddy list Reply w/Quote

Staff
Sean Kelly
Free Thinker

offline
Registered: Jan 2003
Local time: 07:17 PM
Location: Silicon Valley
Posts: 4292

post #3  quote:

Just a quick look at some numerology analysis:

The code repeats three times, and is itself comprised of a repeating pattern, in two tracks, made up of 3 distinct symbols: dot, dash and bar. If the symbols are translated into numerical representations so as to be more easily read where dot=2, dash=1, bar=3, then the two tracks look like this:

TRACK 1 - [123212321232123212321]
TRACK 2 - [321232123212321232123]

Track 1's repeating pattern is "1232"
Track 2's repeating pattern is "3212"

It patterns in repetition produce the exact same stream of values except that they are offset by 2 positions. Out of 4 total positions in each pattern, 2 works out to exactly half the pattern being shifted.

The result of adding coinciding positions from each track is always 4. Track 1, position 1 = [1] where track 2, position 1 = [3].. [1] + [3] = 4. For position 2 we get 2+2=4, for position 3 we get 3+1, and so on. It is unlikely that the number 4 is significant in the final outcome because we arbitrarily selected the numbers 1,2 and 3 to represent the symbols sumply for analysis, and there is no evidence that the coders would have selected the same numbers, or even numbers at all.

Now what if we changed our numbers from 1,2,3 ro 1,2,4 - the first three significant digits in base-2 binary system. Then the tracks would show:

TRACK 1 [124212421242124212421]
TRACK 2 [421242124212421242124]

In this case, the sum of each posision alternates between 5 and 4.

Still not very interesting.

Now what if we looked at the tracks as if they are data sets. Then the values might represent an plotted position on a graph over time. If this is the case, and the values of track 1 are plotted out, then we get:

4] __*___*___*___*___*__
2] _*_*_*_*_*_*_*_*_*_*_
1] *___*___*___*___*___*

track 2 would look like this:

4] *___*___*___*___*___*
2] _*_*_*_*_*_*_*_*_*_*_
1] __*___*___*___*___*__

These appear to be triangle waves which are among the most simple waveforms to replicate. Triangle waves, as with any waveform really, may be mathematically represented as a function of varying sine waves modulated upon one other. BUT they say it's supposed to be sophisticated, so it can't be that easy, right? Agreed. Though I will say that the code itself is too simple to represent anything too terribly sophisticated.

Now what if we took the two plotted tracks and laid them over one another? We would see this:

4] *_*_*_*_*_*_*_*_*_*_*
2] _o_o_o_o_o_o_o_o_o_o_
1] *_*_*_*_*_*_*_*_*_*_*

That looks like crap, bu it expresses the criss-cross pattern formed by the two triangle waves, with the "O" are the center of each intersection where they meet. Essentially, the pattern looks like a big XXXXXX rather than the individual \/\/\/\/\/ waves.

Still not very interesting.

Now what if we analyze the number of symbols in each track? Well there are 21 total symbols in each. We know the the number 21's denominators are 1, 3, 7 and 21. Well, 1 isn't a very interesting denominator because we tried breaking up the patterns into 1-digit sized chunks and didn't see anything meaningful. 21 isn't very useful because it's obvious that there is no way to decode something that represents only itself. So 3 and 7 are where we're at.

Now we can look at 3 chunks of 7 digit groupings to end up with:

TRACK 1 - [1232123] [2123212] [3212321]
TRACK 2 - [3212321] [2321232] [1232123]

But I still see a lot of patternous repetition in there that makes me think this is probably not very useful. But what if we break it into 7 chunks of 3..?

TRACK 1 - [123] [212] [321] [232] [123] [212] [321]
TRACK 2 - [321] [232] [123] [212] [321] [232] [123]

Here we see something interesting because it gives us four distinct sets of numbers, only three of which repeat at the end.

Knowing that LEGO is one of the sponsors, I'm tempted to assign alphabet characters to the groupings such that:

[123] = "L", [212] = "E", [321] = "G", [232] = "O"

The result would be:

[L][E][G][O][L][E][G]
[G][O][L][E][G][O][L]

"LEGO Leg" and "Go LEGO L"? Well it doesn't sound terribly meaningful, but I think "Go LEGO" is one of their catch phrases, is it not? Their website shows "PLAY ON" under their logo. I don't see "go lego" on their website anywhere, but google reveals that the phrase is used repeatedly, so perhaps it's an older catch phrase. I've been into Legos since about 1977, so I've seen most of their marketing materials over the years - it definitely rings a bell!

This could be close to something, but doesn't look like it's quite there!



Smile; It confuses people.
Old Post 01-10-2004 02:45 AM
Click here to Send Sean Kelly a Private Message Find more posts by Sean Kelly Add Sean Kelly to your buddy list Reply w/Quote

Staff
Sean Kelly
Free Thinker

offline
Registered: Jan 2003
Local time: 07:17 PM
Location: Silicon Valley
Posts: 4292

post #4  quote:

Okay, here's something odd. The photo of the disc on Mars which is the first one I posted does not contain the same code as the second photo. The first one distinctly shows THREE tracks of data which does NOT REPEAT as it does in the clear photo below. This leads me to think that the code to break is actually the code on the mars disc and not this friggen forgery or whatever it is..

Unfortunately, the mars photo is not clear enough to correctly transcribe the smaller code symbols with any degree of confidence. Now THAT looks like a much better code to try and crack..



Smile; It confuses people.
Old Post 01-10-2004 02:52 AM
Click here to Send Sean Kelly a Private Message Find more posts by Sean Kelly Add Sean Kelly to your buddy list Reply w/Quote

Staff
Sean Kelly
Free Thinker

offline
Registered: Jan 2003
Local time: 07:17 PM
Location: Silicon Valley
Posts: 4292

post #5  quote:

Okay, well one of the clues given on the originally linked page is that the code contains dots and lines - that's binary. That also means that the code can be transcribed from the original, crappy image sent from Mars because that much distinction can be made. It's clear that a three-level distinction would have been impossible, but this shouldn't be too bad, so here we go. Track 1 is the one closest to the edge of the disc:

TRACK 1 [1010011100011111100110001]
TRACK 2 [0110001111011100010100100]
TRACK 3 [1100001000110011001100001]

25 times 3 bits of data. Let's get crackin'.

And please let me know that I'm not just talking to myself here. COMEON - it's FUN, dangit!



Smile; It confuses people.
Old Post 01-10-2004 03:09 AM
Click here to Send Sean Kelly a Private Message Find more posts by Sean Kelly Add Sean Kelly to your buddy list Reply w/Quote

Staff
Sean Kelly
Free Thinker

offline
Registered: Jan 2003
Local time: 07:17 PM
Location: Silicon Valley
Posts: 4292

post #6  quote:

TRACK 1 [1010011100011111100110001]
TRACK 2 [0110001111011100010100100]
TRACK 3 [1100001000110011001100001]


taking 1 bit from each track in order of position and set into groups of three bits (which allows up to 16 possible values), we would get 25 sets. Suppose we wanted to have enough bits to transcribe each grouping into a letter of the alphabet, then our bit-word would have to be made up of 5 bits which yields 32 possibilities. This also works out conveniently because the total number of bits both cumulatively and per track are evenly divisible by 5. Of course this also means that the values 27 to 32 will be undefined, so if those values appear, we won't know what letter/character to assign to them. So this could be a wild goose chase, but it's worth the exercise. Breaking the tracks into 5-bit words, we get 15 words, five per track. Fortunately, since there are only 15 words, this means that it would be impossible to utilize all 32 combinations, in which case the subset may be further transposed/encrypted to add meaning to those values that we thought might have been meaningless, so let's take a look:

TRACK 1 [10100] [11100] [01111] [11001] [10001]
TRACK 2 [01100] [01111] [01110] [00101] [00100]
TRACK 3 [11000] [01000] [11001] [10011] [00001]

This just processes the information linearly. I figure if they want someone to crack the code who isn't a master hacker/cracker, that it'll probably read left to right. Now let's translate those bit strings into values so that they're easier to look at:

TRACK 1 [5] [7] [30] [19] [17]
TRACK 2 [6] [30] [14] [20] [4]
TRACK 3 [3] [2] [19] [25] [16]

So we can see a couple repeats in there being 19 and 30. When it comes to eyeballing simple ciphers, the most common elements of any given language are the ones you look for to repeat. In the case of the English language (which I presume the decoded message to be in! ) these most frequently repeated things include spaces between words and the commonly selected letters in Wheel Of Fortune: RSTLNE AOI. If we decided that 19 was our space character, then we would end up with words that look like the following:

[5,7,30] [17,6,30,14,20,4,3,2] [25,16]

This discourages me because there are not many phrases that end with a two-letter word, but let's check out the alternative where 30 is the space character:

[5,7] [19,17,6] [14,20,4,3,2,19,25,16]

The lack of repeating characters make me think that this code is not a simple cipher. If it were, then there would be letters that you could figure out (such as the two letter words containing either 25,16 or 5,7, for which there are only so many valid combinations the could apply) would appear in other words in order for you to have a clue as to that word. So, due to the lack of clues here, this would appear to be the wrong approach.



Smile; It confuses people.
Old Post 01-10-2004 03:37 AM
Click here to Send Sean Kelly a Private Message Find more posts by Sean Kelly Add Sean Kelly to your buddy list Reply w/Quote

chodder
INRH4xx0r^

offline
Registered: Jul 2003
Local time: 10:17 PM
Location: LV-426
Posts: 7051

post #7  quote:

what do you think the colors mean? is it part of the cyphering process or is it just design?


------------------------------------------------------------>Respectfully,|
------------------------------------------------------------>-Cho--------^
Old Post 01-10-2004 06:34 PM
Click here to Send chodder a Private Message Visit chodder's homepage! Find more posts by chodder Add chodder to your buddy list Click Here to Ignore chodder REPORT this Post to a ModeratorNOMINATE this Post for Reward Points Reply w/Quote

Larke2000
INReview Maven

offline
Registered: May 2003
Local time: 10:17 PM
Location:
Posts: 3859

post #8  quote:

i'm intrigued.


bring in the logic probe!

1100101101110011110000
Old Post 01-12-2004 08:58 AM
Click here to Send Larke2000 a Private Message Find more posts by Larke2000 Add Larke2000 to your buddy list Click Here to Ignore Larke2000 REPORT this Post to a ModeratorNOMINATE this Post for Reward Points Reply w/Quote

Staff
Sean Kelly
Free Thinker

offline
Registered: Jan 2003
Local time: 07:17 PM
Location: Silicon Valley
Posts: 4292

post #9  quote:

The colors are not a part of the code. That was my first inclination as well, but they are actually used to calibrate the camera in the martian atmosphere and lighting in order to perform color corrections on images received back at Earth. Most camera-bearing spacecraft contain some sort of mechanism for calibration such as this. As it was specifically mentioned on one of the mission pages, I assume that they have nothing to do with the message encryption.


Smile; It confuses people.
Old Post 01-13-2004 06:19 AM
Click here to Send Sean Kelly a Private Message Find more posts by Sean Kelly Add Sean Kelly to your buddy list Reply w/Quote

Larke2000
INReview Maven

offline
Registered: May 2003
Local time: 10:17 PM
Location:
Posts: 3859

post #10  quote:

quote:
Clue No. 1, January 10, 2004:

A longer text using the same code would read:

-llll-llll-----l-ll-lll-l-l-l-ll--l---lll-------ll---llll----l----ll-l-l-llll--l---lll-l-ll-ll-----llll-ll-l--lll-ll-l-llll--l---ll---lll--l----ll----ll--ll--l---l-l-lll----l-l-ll---------lll--l-l-llll-----lll-l-l-lll-ll-lll---lll--l-l-l-ll---ll-ll--llllll-l---ll---lll-ll----ll-l---l-l--l-llll--l.---ll---lll--l-----llll-llll------lll-lll-lllll-ll-ll-llll--ll---lll----l-ll--,---ll---lll--l-----l-ll-l-lll-l-l---ll----ll--llll---llll---l-llll----l-lllll----ll--l---lllllll,---ll----ll--ll--ll----l---ll---llll---ll--l--ll---ll--llll---lll--l-ll-------ll----ll--ll-l-ll-------ll--ll--lll-l-ll-ll-l---l-l--lll---lll-l-lll-l-l---ll-ll--lll-l-----l---l-l-l-l---lll-l--------lll-l-llll--ll-l--lll--ll--ll----l---ll-----ll---lll-l-l-llll-----l---ll-llllll---l-l-l-l---lll-l-------llll-llll--lll--lll-l-ll-llll-----llll.

Clue no. 2, January 12, 2004:

The text in clue No. 1 is from Carl Sagan's Pale Blue Dot.


hints off the site. i don't have a copy of 'pale blue dot' to peruse in an effort to at least match the punctuatuion.



bring in the logic probe!

1100101101110011110000
Old Post 01-13-2004 06:40 PM
Click here to Send Larke2000 a Private Message Find more posts by Larke2000 Add Larke2000 to your buddy list Click Here to Ignore Larke2000 REPORT this Post to a ModeratorNOMINATE this Post for Reward Points Reply w/Quote

Staff
Sean Kelly
Free Thinker

offline
Registered: Jan 2003
Local time: 07:17 PM
Location: Silicon Valley
Posts: 4292

post #11  quote:

FIRST CLUE

Very good..

quote:

-llll-llll-----l-ll-lll-l-l-l-ll--l---lll-------ll---llll----l----ll-l-l-llll--l---lll-l-ll-ll-----llll-ll-l--lll-ll-l-llll--l---ll---lll--l----ll----ll--ll--l---l-l-lll----l-l-ll---------lll--l-l-llll-----lll-l-l-lll-ll-lll---lll--l-l-l-ll---ll-ll--llllll-l---ll---lll-ll----ll-l---l-l--l-llll--l.---ll---lll--l-----llll-llll------lll-lll-lllll-ll-ll-llll--ll---lll----l-ll--,---ll---lll--l-----l-ll-l-lll-l-l---ll----ll--llll---llll---l-llll----l-lllll----ll--l---lllllll,---ll----ll--ll--ll----l---ll---llll---ll--l--ll---ll--llll---lll--l-ll-------ll----ll--ll-l-ll-------ll--ll--lll-l-ll-ll-l---l-l--lll---lll-l-lll-l-l---ll-ll--lll-l-----l---l-l-l-l---lll-l--------lll-l-llll--ll-l--lll--ll--ll----l---ll-----ll---lll-l-l-llll-----l---ll-llllll---l-l-l-l---lll-l-------llll-llll--lll--lll-l-ll-llll-----llll.


okay, so the clue above translated into numerics for easier reading is as follows:

quote:

01111011110000010110111010101011001000111000000011
00011110000100001101010111100100011101011011000001
11101101001110110101111001000110001110010000110000
11001100100010101110000101011000000000111001010111
10000011101010111011011100011100101010110001101100
11111101000110001110110000110100010100101111001. 00011000111001000001111011110000001110111011111011
0110111100110001110000101100,000110001110010000010
11010111010100011000011001111000111100010111100001
0111110000110010001111111,000110000110011001100001
00011000111100011001001100011001111000111001011000
00001100001100110101100000001100110011101011011010
00101001110001110101110101000110110011101000001000
10101010001110100000000111010111100110100111001100
11000010001100000110001110101011110000010001101111
11000101010100011101000000011110111100111001110101
101111000001111.


Now this confirms a few assumptions:

1) the code does indeed cipher/encrypt text characters
2) the SPACE character is included in this encryption, but not punctuation or line breaks
- thus we need 26 letters and a 27th character for SPACE (or some other "word separator")

Let's look at it a little more closely now before diving into Carl Sagan's text to look for a match.

First of all, the sample text is comprised of multiple sentences, and some sentences have multiple parts separated by commas. So we'll start by breaking the sample apart into sections separated by the punctuations since each section represents a complete string of characters run through the encryption algorithm:

SECTION 1: [0111101111000001011011101010101100100011100000001
10001111000010000110101011110010001110101101100000
11110110100111011010111100100011000111001000011000
01100110010001010111000010101100000000011100101011
11000001110101011101101110001110010101011000110110
011111101000110001110110000110100010100101111001]

SECTION 2: [0001100011100100000111101111000000111011101111101
10110111100110001110000101100]

SECTION 3: [0001100011100100000101101011101010001100001100111
10001111000101111000010111110000110010001111111]

SECTION 4: [0001100001100110011000010001100011110001100100110
00110011110001110010110000000110000110011010110000
00011001100111010110110100010100111000111010111010
10001101100111010000010001010101000111010000000011
10101111001101001110011001100001000110000011000111
01010111100000100011011111100010101010001110100000
0011110111100111001110101101111000001111]

Now the first big clue that I see when I look at these for strings is that sections 2, 3 and 4 all share a common opening data set of [00011000]. Sections 2 and three share even more than that, but all three share this. Why is this significant? Because what we know of english print states that the first character to appear following any punctuation will be a SPACE character. Section 1 does not start with this because the first sentence is not preceded by a SPACE. So I think we have found not only the code for our space character, but also our character bit size: 8 bits. Another important revelation is that there is no sort of complicated data compression scheme applied to the bit stream. If data compression were applied we would not see such repeatability in the data. So this is good news for us!

But still we have some mystery remaining because the original code contained 75 bytes of data and 75 is not evenly divisible by 8. Thus the bit size is not necessarily fixed for every character. So let's see what else we can spot. Let's locate every occurrance of our 8-bit space character and isolate it to see if we can spot any other repeatability between words since words are separated by spaces!

SECTION 1:
[011110111100000101101110101010110010001110000]
[00011000]
[1111000010000110101011110010001110101101100000111
101101001110110101111001]
[00011000]
[1110010]
[00011000] [0110011001000101011100001010110000000001110010101
11100000111010101110110111000111001010101100011011
0011111101]
[00011000]
[1110110000110100010100101111001].

SECTION 2:
[00011000]
[1110010000011110111100000011101110111110110110111
100110001110000101100],

SECTION 3:
[00011000]
[1110010000010110101110101]
[00011000]
[0110011110001111000101111000010111110000110010001
111111],

SECTION 4:
[00011000]
[0110011001100001]
[00011000]
[1111000110010011000110011110001110010110000]
[00011000]
[0110011010110000000110011001110101101101000101001
11000111010111010100011011001110100000100010101010
0011101000000001110101111001101001110011001100001]

[00011000]
[0011000111010101111000001000110111111000101010100
01110100000001111011110011100111010110111100000111
1].

Okay, lovely. So now it would appear that the text from Carl Sagan's writing will match a pattern:

WORD1 WORD2 WORD3 WORD4 WORD5. WORD6, WORD7 WORD8, WORD9 WORD10 WORD11 WORD12.

Now our next big clue comes from the word lengths themselves. WORD3 appears to be the shortest possible word, containing 7 bits. We all know that there are only two words in the english language that contain a single character and they are "A" and "I". This character may be either, but we know it must be one of the two. But before we get into that, lets look at some more bit lengths:

WORD1 = 45 bits - denominators: 1,3,5,9,15,45
WORD2 = 73 bits - denominators: 1,73
WORD3 = 7 bits - denominators: 1,7
WORD4 = 110 bits - denominators: 1,2,5,10,11,22,55,110
WORD5 = 31 bits - denominators: 1,31
WORD6 = 70 bits - denominators: 1,2,5,7,10,14,35,70
WORD7 = 25 bits - denominators: 1,5,25
WORD8 = 55 bits - denominators: 1,5,11,55
WORD9 = 16 bits - denominators: 1,2,4,8,16
WORD10 = 43 bits - denominators: 1,43
WORD11 = 149 bits - denominators: 1,149
WORD12 = 101 bits - denominators: 1,101

As you can see, there is no predictability in the denominators for each word length. This means that every character has a varying number of bits. So let's go back and look at our smallest character discovery.

I have armed myself with a text copy of Carl Sagan's "Pale Blue Dot". Carl, as you may or may not be aware, was a co-founder of the Planetary Society who is sponsoring this contest, before he passed away in 1996. Anyway, as I mentioned, the character must either be an "a" or an "I". So I performed a comprehensive search of the text for the string " I " - naturally this character string pops up frequently when you're discussing your own thoughts, but not nearly as much as the phrase " a ". As it turns out, there are only two sentence fragments in the book that contain five words where the middle word is "I", but as it turns out neither ends with a period punctuation. For this reason it is more likely that the character is an "a". So let's play around with "a" for a bit: we're going to see what other occurrences of the "a" bitstream are contained within the words:

[SPACE] = 00011000
A = 1110010

SECTION 1:
[011110111100000101101110101010110010001110000]
[ ]
[111100001000011010101 A 001110101101100000111101101001110110101111001]
[ ]
[ A ]
[ ]
[011001100100010101110000101011000000000 A 10111100000111010101110110111000 A 101011000110110011111101]
[ ]
[1110110000110100010100101111001].

SECTION 2:
[ ]
[ A 00001111011110000001110111011111011011011110011000
1110000101100],

SECTION 3:
[ ]
[ A 000010110101110101]
[ ]
[0110011110001111000101111000010111110000110010001
111111],

SECTION 4:
[ ]
[0110011001100001]
[ ]
[111100011001001100011001111000 A 110000]
[ ]
[0110011010110000000110011001110101101101000101001
11000111010111010100011011001110100000100010101010
0011101000000001110101111001101001110011001100001]

[ ]
[0011000111010101111000001000110111111000101010100
01110100000001111011110011100111010110111100000111
1].

Let me continue by saying I have NO IDEA if I'm on the right track here, I'm just following my nose. But Substituting all the "A" strings in the text has yielded a fair number of "A"s! Moreover, one of them is in such a position that it may help searching the text of Pale Blue Dot for ", A" (beginning of SECTION 3) where A begins a 3 or four letter word. The bitstream following the A in word 7 is 18 bits in length which means there are probably either two or three characters following assuming a 6 to 9 bit length for each remaining character. The word preceding the comma (WORD6) will also start with the letter "A" and be the first word in the sentence.

Strike out on that approach, no such sentence fragment exists. A further pisser is that the text does not contain a sentence matching the pattern "WORD6, WORD7 WORD8, WORD9 WORD10 WORD11 WORD12." There are six sentences beginning with ". WORD6, WORD7 WORD8, ", but none of them end correctly to match what we're looking for. So I guess we're back to square 1 (...?)



Smile; It confuses people.
Old Post 01-14-2004 12:05 AM
Click here to Send Sean Kelly a Private Message Find more posts by Sean Kelly Add Sean Kelly to your buddy list Reply w/Quote

chodder
INRH4xx0r^

offline
Registered: Jul 2003
Local time: 10:17 PM
Location: LV-426
Posts: 7051

post #12  quote:

have you been to the matrix site? they have this whole elaborate thing with encryptions and binary codes. check out the attachment below. i dont know if it can help you or not.

Attachment:
decoder.jpg (16.03 kb, 38 views)


------------------------------------------------------------>Respectfully,|
------------------------------------------------------------>-Cho--------^
Old Post 01-14-2004 01:31 AM
Click here to Send chodder a Private Message Visit chodder's homepage! Find more posts by chodder Add chodder to your buddy list Click Here to Ignore chodder REPORT this Post to a ModeratorNOMINATE this Post for Reward Points Reply w/Quote

Staff
Sean Kelly
Free Thinker

offline
Registered: Jan 2003
Local time: 07:17 PM
Location: Silicon Valley
Posts: 4292

post #13  quote:

That's a standard hexadecimal to binary conversion. Converts from a base-16 numeric system to a base-2. Useful for all types of computer engineering, but so far I haven't seen an indication that base-16 has any bearing on the code here.


Smile; It confuses people.
Old Post 01-14-2004 01:42 AM
Click here to Send Sean Kelly a Private Message Find more posts by Sean Kelly Add Sean Kelly to your buddy list Reply w/Quote

Staff
Sean Kelly
Free Thinker

offline
Registered: Jan 2003
Local time: 07:17 PM
Location: Silicon Valley
Posts: 4292

post #14  quote:

Starting over with the first two clues. What if the notion that the first character in SECTIONs 2-4 is a SPACE is correct, but that the space character is much smaller than the 8 bits identified and that those three sections just have more in common than simply the space character? I've come to the conclusions that the logic is sound with the SPACE character, but that the character itself has not quite been isolated correctly. This is because a search of the Pale Blue Spot text reveals that there is no sentence ending in "A ____ ____." nor "I ____ ____." This means that the first sentence is NOT necessarily 5 words and that the "A/I" word there doesn't quite make the sense that it seemed to at first.

(Just so you know I'm using software to process the e-text of Pale Blue Dot, not just a feeble, manual search. It has pretty comprehensive pattern-matching capabilities.)

So, in an effort to determine the exact nature of the SPACE character, let's try some smaller bit patterns and see how our words break apart, beginning with the original 4-section break-down declared above. First, 7 bits:

SPACE = [0001100]

SECTION 1: [011110111100000101101110101010110010001110000]
[ ]
[0111100001000011010101111001000111010110110000011
1101101001110110101111001]
[ ]
[01110010]
[ ]
[0011001100100010101110000101011000000000111001010
11110000011101010111011011100011100101010110001101
10011111101]
[ ]
[01110110000110100010100101111001]


We'll stop right there because SECTION 1 already displays the same pattern: 5 words with a single-character word in the middle which we have found not to exist in the source text. So let's try again with 6 bits:

SPACE = [000110]

SECTION 1:
[011110111100000101101110101010110010001110000]
[ ]
[001111000010]
[ ]
[1010111100100011101011011000001111011010011101101
01111001]
[ ]
[001110010]
[ ]
[ ]
[0110010001010111000010101100000000011100101011110
00001110101011101101110001110010101011]
[ ]
[110011111101]
[ ]
[001110110]
[ ]
[100010100101111001].

This certainly seems to have changed things up, but we ended up with a funny double-space in the middle there which would suggest that it is not a correct translation. Furthermore, we end up with a significant number of VERY short bit strings which are unlikely to make things any better for our word options. Still, they could be trying to get tricky, so let's analyze this a bit more before moving on..

1A = 45 bits
1B = 12 bits
1C = 57 bits
1D = 9 bits (similarity to 1G)
1E = 87 bits
1F = 12 bits
1G = 9 bits (similarity to 1D)
1H = 18 bits

Here's some good news: they all have bit lengths that are evenly divisible by three. You might recall that the code we ultimately want to break also has a bit length evenly divisible by three.

This is our best lead yet: 3 is a major player in deciphering the code.

So now we'll break down the remaining three sections according to the newly defined space character:

SPACE = [000110]

SECTION 1:
[011110111100000101101110101010110010001110000]
[ ]
[001111000010]
[ ]
[1010111100100011101011011000001111011010011101101
01111001]
[ ]
[001110010]
[ ]
[ ]
[0110010001010111000010101100000000011100101011110
00001110101011101101110001110010101011]
[ ]
[110011111101]
[ ]
[001110110]
[ ]
[100010100101111001].

SECTION 2:
[ ]
[0011100100000111101111000000111011101111101101101
11100110001110000101100]

SECTION 3:
[ ]
[001110010000010110101110101]
[ ]
[ ]
[011110001111000101111000010111110]
[ ]
[010001111111]

SECTION 4:
[ ]
[ ]
[011001100001]
[ ]
[001111]
[ ]
[010011]
[ ]
[011110001110010110000]
[ ]
[ ]
[011010110000]
[ ]
[011001110101101101000101001110001110101110101]
[ ]
[1100111010000010001010101000111010000000011101011
11001101001110011001100001]
[ ]
[0]
[ ]
[00111010101111000001]
[ ]
[1111110001010101000111010000000111101111001110011
10101101111000001111]

Our mysterious double-space has appeared a couple more times, and now there is a glaring anomaly that popped up in section 4 with a 1-bit length word. For now we'll consider that it might be a glitch in the data - let's look at everything else we have before we discard this approach.


SECTION 1 WORDS:
1A = 45 bits
1B = 12 bits
1C = 57 bits
1D = 9 bits (similarity to 1G)
1E = 87 bits
1F = 12 bits
1G = 9 bits (similarity to 1D)
1H = 18 bits

SECTION 2 WORDS:
2A = 72 bits

SECTION 3 WORDS:
3A = 27 bits
3B = 33 bits
3C = 12 bits

SECTION 4 WORDS:
4A = 12 bits
4B = 6 bits
4C = 6 bits
4D = 21 bits
4E = 12 bits
4F = 45 bits
4G = ??? Anomaly!
4H = 20 bits <- 1 missing bit to be a product of 3!
4I = 69 bits

EVERYTHING is miraculously a product of 3 except 4G and 4H, so I'm going to assume a mistake in the data and prepend 4G to 4H. The new section 4 looks like this:

SECTION 4:
[ ]
[ ]
[011001100001]
[ ]
[001111]
[ ]
[010011]
[ ]
[011110001110010110000]
[ ]
[ ]
[011010110000]
[ ]
[011001110101101101000101001110001110101110101]
[ ]
[1100111010000010001010101000111010000000011101011
11001101001110011001100001]
[ ]
[000111010101111000001]
[ ]
[1111110001010101000111010000000111101111001110011
10101101111000001111]

SECTION 4 WORDS:
4A = 12 bits
4B = 6 bits
4C = 6 bits
4D = 21 bits
4E = 12 bits
4F = 45 bits
4G = 21 bits
4H = 69 bits

MUCH BETTER! It's something of a cheat, but let's see how close we can get with it.

So now because everything is in nice, tidy groups of three, I'm tempted to convert everything into a base-8 (octal) notation which will be easier on the eyes for pattern recognition. I'm going to perform the conversion from left to right, 3 bits per octal digit where the right-most bit is the lest significant digit (LSD - compatible with typical engineering values and hence Microsoft's desktop calculator program). Because of the way we had that data error, and because of the peculiar double-spaces that are showing up, and now that we know everything is a nice, comfortable base-8, it is possible that the octal conversion is required BEFORE performing the pattern-match on the space character. This would eliminate funky matches. So I'm going to start with the original data, perform the base-8 conversion, then translate against our space character definition. Here's what the converted data set looks like:

SECTION 1: [3674055652621600617020653621655407551665710616206
06310534126001625701653556162530663750616606424571
]

SECTION 2: [06162036740356766674616054]

SECTION 3: [06162026565060636170570276062177]

SECTION 4: [0606314106170623063616260060632600631655505161656
50663501052435001657151631410603072570106770524350
03674716557017]

LOVELY! Now we apply word breaks based on what we think is the "space" character which should give is much cleaner results because we can no longer break any character-boundaries which we surely were before. The space character is 6 bits "000110" which translates in octal to "06". Because the space character is 6-bits, or two-characters in octal, it is likely that all other "letters" are two characters as well. We have to make sure not to split a letter up with our space replacement, so we'll separate each letter from the next before doing the replacement:

SECTION 1: [36 74 05 56 52 62 16 00 61 70 20 65 36 21 65 54 07 55 16 65 71 06 16 20 60 63 10 53 41 26 00 16 25 70 16 53 55 61 62 53 06 63 75 06 16 60 64 24 57 1].

SECTION 2: [06 16 20 36 74 03 56 76 66 74 61 60 54],

SECTION 3: [06 16 20 26 56 50 60 63 61 70 57 02 76 06 21 77],

SECTION 4: [06 06 31 41 06 17 06 23 06 36 16 26 00 60 63 26 00 63 16 55 50 51 61 65 65 06 63 50 10 52 43 50 01 65 71 51 63 14 10 60 30 72 57 01 06 77 05 24 35 00 36 74 71 65 57 01 7].

Strangely, we have a stray nibble at the end of sections 1 and 4. This could be an indicator that our pretzel logic may be leading us astray once again. But, as usual, we'll ignore that for now. The base-8 thing appears to be working for us, and if necessary, we'll abandon the idea that a word is always two nibbles (as would seem to be the case evidenced by the strays). So pretending that didn't happen, here's what goes down when we break our sentences apart with the word separator "06":

SECTION 1:
[36 74 05 56 52 62 16 00 61 70 20 65 36 21 65 54 07 55 16 65 71]
[06]
[16 20 60 63 10 53 41 26 00 16 25 70 16 53 55 61 62 53]
[06]
[63 75]
[06]
[16 60 64 24 57 1].

SECTION 2:
[06]
[16 20 36 74 03 56 76 66 74 61 60 54],

SECTION 3:
[06]
[16 20 26 56 50 60 63 61 70 57 02 76]
[06]
[21 77],

SECTION 4:
[06]
[06]
[31 41]
[06]
[17]
[06]
[23]
[06]
[36 16 26 00 60 63 26 00 63 16 55 50 51 61 65 65]
[06]
[63 50 10 52 43 50 01 65 71 51 63 14 10 60 30 72 57 01]
[06]
[77 05 24 35 00 36 74 71 65 57 01 7].

Combined with the strays, unless something VERY sneaky is going on in the encoding (like substitution of single-codes for complete common words), Section 4 now shows a significant development that decreases the likelihood that we're going down the right path. Not only has our double-space returned to haunt us, but there is a series of two single-letter words in a row separated by spaces which couldn't possibly make sense in the english language. How could " a I " or " I a " ever make sense in any meaningful context? Furthermore if either 17 or 23 were characters "a" or "i" two of only five vowels in our written language, you would see them appear repeatedly elsewhere in the message, yet neither appears ever again!

SO - we're onto something with the base-8 stuff, but there's still something wrong with our definition for the word separator, and/or the way we're applying it, and/or our assumptions about how many 3-bit nibbles make up an actual word.



Smile; It confuses people.
Old Post 01-14-2004 03:38 AM
Click here to Send Sean Kelly a Private Message Find more posts by Sean Kelly Add Sean Kelly to your buddy list Reply w/Quote

chodder
INRH4xx0r^

offline
Registered: Jul 2003
Local time: 10:17 PM
Location: LV-426
Posts: 7051

post #15  quote:

why do i have this strange feeling that leaving out those pieces of nibble will screw it up. it has to work, why would they just have it so there is extra pieces left... unless they are trying to decieve us.


------------------------------------------------------------>Respectfully,|
------------------------------------------------------------>-Cho--------^
Old Post 01-14-2004 04:14 AM
Click here to Send chodder a Private Message Visit chodder's homepage! Find more posts by chodder Add chodder to your buddy list Click Here to Ignore chodder REPORT this Post to a ModeratorNOMINATE this Post for Reward Points Reply w/Quote
Time: 03:17 AM Post New Thread   
Pages (3):  [1] 23 »   Print Version | Email Page | Bookmark | Subscribe to Thread
INReview INReview > Archives > Science > Mars > Help crack the code!
Search this Thread:
Forum Rules:
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts
HTML code is OFF
vB code is ON
Smilies are ON
[IMG] code is ON
Forum Policies Explained
 
Rate This Thread:

< - INReview.com >

Copyright ©2000 - 2007, Jelsoft Enterprises Limited
Page generated in 0.63759589 seconds (85.96% PHP - 14.04% MySQL) with 36 queries.

ADVERTISEMENTS
Support This Site! Shop @ INReview!


© 2007, INReview.com.   Popular Forums  My Favorites All Forums   Web Hosting and Web Design by Psyphire.
INReview.com: Back to Home