Pimm – Partial immortalization

A Biotech Geek (micro)Blogger’s adventures through science, technology and the web…

  • email me

    [attilacsordas][at][gmail.com]
  • Attila on Twitter

    • Red Mars before sleep &after JavaScript:dropping windmills=>spin=>heat in coils=>release to atmosphere, winds slowing down=>dust storms down 14 hours ago
    • Hard to believe, learn in what sense? See/trial & error? RT @GreatDismal I learn more watching people use new tech than using it myself 16 hours ago
    • nephews (11,13) just learned how to run, modify & debug the 'Hello World' JavaScript on the iPhone w/ Notes, variables & functions next ;) 21 hours ago
    • Family party this afternoon: preparing w/little JavaScript snippets on the iPhone for my nephews so they can run scripts on their iPod touch 1 day ago
    • Safari is losing http requests to Chrome/Firefox on my laptop due to the lack of an omnibox capability 1 day ago
  • Recent Comments

    GB on Visualize 23andMe haplogroup d…
    MaryHollmy on Google Health, IBM: real-time,…
    colon hydrotherapy l… on Why the Dyna-Vision G1 Android…
    revathi on Human mitochondrial DNA vs. nu…
    Erik Cole on Michael Rose, evolutionary SEN…
    drugrehabusa on Stem Cell Therapy Market, US, …
    Letago on Can you tell a good article fr…
    Online Offers on Life extension people are happ…
    เสื้อผ้า on How to read PDF files on iPhon…
    atsoft on Add stem cells and eat the lab…
  • licence

    Creative Commons License
  • c

  •  

    March 2008
    M T W T F S S
    « Feb   Apr »
     12
    3456789
    10111213141516
    17181920212223
    24252627282930
    31  

Archive for March 3rd, 2008

How much data is produced by a life scientist/day?

Posted by attilachordash on March 3, 2008

3TBThe current operational idea behind Google’s Palimpsest Project is to ship 3TB (terrabyte= 1.0995 x 1012 bytes) drive array (Linux RAID-5) for scientists, who upload their data and FedEx the hard drives back to Google. Google then make those data publicly available and manageable. This file transfer method was heavily criticized by Dai Davies in Ars Technica. “This is a bit like using Flintstones technology in the Internet era.” although there are arguments behind this choice, see Jon Trowbridge’s 11th slide. Forget about this uploading/updating problem to the amount of this post. Here I only care about the end-user, the scientist who is provided with whatever tool to upload 3TB of research, measurement data on behalf of her research facility. While for an astronomer hundreds of gigabytes/day can seem as a normal output my angle is on how a life scientist and his data fits to this 3TB equation and eventually to the Palimpsest Project. Accordingly, my question is this:

How much data is produced by an average wet lab scientist, biomedical researcher/day?

I try to come out with a rough guess in the hope of subtle corrections from the commenters: I assume the following (rather busy) daily production of data by our average scientist in an average lab:

running a gel – making a gel photo 300 KB .tiff

preparing 5 samples for sequencing at the core facility, output: 500 KB – 1MB ab1, seq files

FACS sorting of different cell populations: 1 MB of special FACS files and 100 KB pdf out of it

Read the rest of this entry »

Posted in bioinformatics, biology, biotechnology, data, science, technology | 2 Comments »