In my last post I described how I stumbled across (all on my ownsome) a way to access the police files on Schoenberg's death. Some people (hi Zoe!) have had problems accessing the images so I've uploaded them to flickr here in no particular order.
In doing so three images in particular caught my eye, partly becuase they're notes written in the same type of moleskine pad that I have (the squared one - love it) and partly because I spotted some familiar bits and pieces.
On one there is one (or a couple of?) DNA sequences

The sequences as two strings are:
TCGACGGGGCGTTATAATGAAGAGGATTCTTTGGAAGGCGCGCTATGTCTTGAGGCTAGAGCGAACTGTGAGTGA
CCTCGTATCGAATCCGGGCTTATATGTATCACCGGGARRTTGGACAACTGA
The first thing I'm tempted to do with a DNA sequence is to see what it is - I BLASTed it (a biological search algorithm) against the public sequence databases at the NCBI and got nothing. The second thing that I was tempted to do was to translate the DNA sequence into a protein sequence, each three letters in the DNA sequence represent one amino acid and there is a letter code used to represent each amino acid. You can do this for yourself here at ExPASy, copy and paste the sequence above and hit translate. You get six protein sequences back because with DNA sequences you can't always be sure whether the codons start at the first, second or third letter of the sequence and you don't know whether you should be reading forwards or backwards. For a table to do the same thing manually check out wikipedia here.
Anyway - this one's easy, it's forwards and the first DNA letter (I'm going to start calling them bases because it feels patronising calling them letters) is the start of the codon:
S T G R Y N E E D S L E G A L C L E A R A N C E Stop P R I E S G L I C I T G X L D N Stop
Certain things leap out: Stgry Needs Legal Clearance - Stingray needs legal clearance perhaps (because I like supermarionation); the two sentences are both ended by stop codons (TGA - oh yes, your DNA sequence even has punctuation) which is nice; and the second sentence makes less sense.
I thought that it might be in a different frame (the codons starting on the second or third base) but when you look at the other possible translations they make less sense (no nice neat stop codon at the end). I'm also intrigued by the X. X doesn't represent a particular amino acid and it's in there because ExPASy couldn't translate the codon ARR, which is fair enough because R isn't in DNA, it's an IUPAC ambiguity code - a fancy arsed way of saying "we're not sure whether this is an A or a G, but it's definitely one or the other". So ARR could be AAG, AGA, AAA or AGG, tranlsating that to protein there are only two options, L or K. Neither make any more sense. LDN at the end suggests London to me, but in what context? There are 323 anagrams of P R I E S G L I C I T G X L D N, none of which leap at me. I have a feeling there is more decrypting to be done to that part of the message, but I haven't got it yet. Tish.
Still, it's a start and Markus is egging us on - one of the notes on another page is "*idea* messages within life". The same page also references Craig Venter (coincidentally, I've blogged about him before) - I have a feeling that he's mentioned in order to reference his work on synthetic biology, building organisms (specifically bacteria in the first instance) from scratch for particular purposes, such as fuel production.
8 comments:
This is amazing! Makes me realise how much I still need to learn. Thanks so much for all your hard work. Rachcon1 x
Mike,
I've been looking through some of the stuff in Markus' notebook and may have a lead as to what his illness was.
Firstly, in 1314-libreta3 Markus mentions PTU twice. I googled PTU and it's a drug used to treat Graves Disease:
http://en.wikipedia.org/wiki/Graves_disease
Could the $5000 paid have been towards research?
Secondly, in 1314-libreta2, the name RM Santos comes up. A quick search sent me to Ophthalmic Genetics:
http://www.informaworld.com/smpp/title~content=g785731538~db=all
Problems with the eyes are key symptoms of Graves Disease.
Finally, in 1314-libreta2
, the number 1789023 is written down.
Searching this took me to the NBCI - It's a DNA Binding protein:
http://www.ncbi.nlm.nih.gov/sites/entrez?dopt=GenPept&cmd=Retrieve&db=protein&list_uids=1789023
I'm not sure how this Protein connects to Markus - perhaps he was testing it for a cure to the disease?
I'm probably barking up the wrong tree, but to be honest we need to search through every scrap of evidence we have right now.
Hope you're okay.
xo
What would your interpretation of the "ARR" towards the end of the code? It doesn't code for anything does it? It's not even written in bases (unlike the rest of the message) it seems a bit strange that someone who seemed to know so much about genetics would put it in there by accident but I just can't work it out.
http://acronyms.thefreedictionary.com/ARR
All are popular acronyms for ARR - none jump out at me though.
Okay.
I'm really freaked out.
A quick Google search of LDN brought up Low Dose Naltrexone - a drug which may be able to treat autoimmune diseases such as Graves' Disease.
http://www.lowdosenaltrexone.org/
Anonymous: The ARR is odd, and it strikes me as something that is there because of the fact it's a message in code rather than because it's a DNA sequence. It's not necessarily out of place in a DNA sequence, but then it's equivocal - so why is it there?
Zoe: Cool - you've been busy! Given the hint for the first section in the notebooks I'm sure that something else in there will unlock the next section. Still at work so sorry for being unresponsive. Will probably have time to get into it more later :)
Zoe, I'm not sure about the link to Graves' Disease:
- PTU is mentioned, but it looks like it's a person, not a drug: "Call PTU...Confirm W. PTU"
- Graves disease causes eye inflammation, whereas R M Santos-Anderson seems to work on retinal problems.
- LDN can also be an acronym for any of these things: http://acronyms.thefreedictionary.com/LDN Autoimmune diseases is a fairly broad category, and Graves disease isn't specifically mentioned.
- That protein is a bacterial one, not a human one. Although the Wikipedia article does say that bacteria may trigger the disease, I think typing many random numbers into NCBI will bring up something. It's also in a note about DEPAA, not about DNA.
I've also tried Caesar shift ciphers on the second 'sentence' of the DNA code, but nothing jumps out.
It may be co-incidence, but if you take the complementary sequence to the second strand, and translate it, then one of the reverse reads (which would be expected, since the strands lie in opposite directions) also starts with the amino acids STG. In full:
STGXRATMYIRA-AML
Can't make head nor tail of it, but the chances of the first three amino acids being the same by chance are ~1 in 8000 (bit less, since we've tried multiple approaches). The letters "STGRAY" are also amongst them.
Post a Comment