Google’s Palimpsest project: promiscuous distribution of all science data sets

September 25, 2007October 15, 2008 Posted in data, google, googleplex, IT, Sci Foo, science, science slideshows, SciFoo, technology, USA

Googlestein Google’s Palimpsest project, once realized (in the near future) has the potential to change the way science is done by accepting gigantic (raw?) data sets from all disciplines and making them open and free (including dark data?). Jon Trowbridge from Google Inc. had a presentation on SciFoo, 2007 at the Googleplex not documented well, but you can download his slides on the project that was presented at XTech 2007 in Paris, this May: Making Massive Datasets Universally Accessible and Useful Presentation. You are not restricted to the zip file as Jon kindly gave a permission to publish his slides with SlideShare here. From his intro: This talk will discuss a project underway at Google to collect and distribute large scientific datasets using a 21st century “Sneakernet”: multi-terabyte disk arrays shipped via FedEx and other common carriers.
The project is strictly non-profit, but fits well with Google’s mission.

Published by attilacsordas

AgeCurve founder using proteomes two demystify agings two add more healthy years, ex mito-stem cell biologist, bioinformatician, Open Lifespan philosopher View all posts by attilacsordas

43 thoughts on “Google’s Palimpsest project: promiscuous distribution of all science data sets”

Deepak says:

September 25, 2007 at 3:55 pm

Attila

This is great. Thanks for posting.
Pingback: » Google and promiscuous distribution of data » business|bytes|genes|molecules
Pedro Beltrao says:

September 27, 2007 at 4:15 am

Thanks for posting the slides. It is interesting. This is still very much for very large data volume but maybe whatever they build around this (maybe a GBase segment for scientific data) could be use for lower data volume uploaded via net.
Deepak says:

September 27, 2007 at 4:14 pm

Pedro,

At least for now they don’t necessarily have plans to do much with the data other than make it available on the web. Ideally, I think a Freebase/GBase type approach would be great. With an appropriate API and knowledge of the data structure, people could start building apps and of course, Google would do a great job of indexing the whole thing
Pingback: Dark Data « Patient Centric Healthcare
Pingback: Tech News » Blog Archive » Google to Host Terabytes of Open-Source Science Data
Pingback: Digging Digitally » More on Google: Free Hosting of Open Science Data
Pingback: Google, gapminder and scientific data sets : business|bytes|genes|molecules
Pingback: Google Said To Be Prepping Launch Of Social Science Data Network
Pingback: Steffen Prohaska » Blog Archive » Storing Scientific Data at Google?
Pingback: Google to host Open Source scientific data sets at The Musings of Chris Samuel
Pingback: Simone Cortesi » Blog Archive » Google Research
Pingback: The Fluff and the Mediocrity » Science News Round-Up #2
Pingback: Il blog dei Marsiaj - » Google ospitarà terabytes di dati scientifici open-source ?
Pingback: Google Decides to Host a Whole Lot of Scientific Data - Palimpsest Project
Pingback: Google lanzará una plataforma de servicios para cientificos - Blog de Dr. Max Glaser
Pingback: Communications
Laser says:

January 21, 2008 at 6:10 pm

That’s great.
I am curious what browse/search feature Google will provide. It will be nice the data be well annotated using semantic web technology.
Pingback: Google schenkt Forschern Speicherplatz « Thalex
Lee Watkins says:

January 22, 2008 at 11:43 pm

We at the Ctr. for Inherited Disease Research routinely ship data from genome scans to PIs and back-and-forth to NLM/NCBI on large encrypted disk arrays. We also continually archive and will eventually have to delete all the level 0 or “raw” data – the actual image data from which the genomic data is derived. I think someday we will regret deleting this data since better algorithms are developed every day yet many of these studies use the very last of available DNA from a given research subject who may be dead or otherwise no longer available to extract more. Having someplace to store them for future re-analysis, imho, be a great service.

Now, what about the data from extremely high-res 3D scana of the world’s entire collection of several hundred thousand cuneiform tablets, the world’s oldest written records and in many ways the foundational documents of human civilization? It might be a few petabytes or so: http://www.jhu.edu/digitalhammurabi/
Pingback: Rumors suggest Google is set to open scientific data store « The “Meta” Internet: The genesis of a “virtual” Silicon Valleys leveraging the power of the Internet.
Pingback: Google向科学家提供免费数据储存 | 七平米
Pingback: Google Shutters Its Science Data Service — instantwebmeetings.com - Video Collaboration, E Learning, Video Meeting, Unified Communications
Pingback: Open Source Science : Scriptyx.com
nobitfashion says:

October 30, 2009 at 9:39 am

Thanks for posting the slides. It is interesting. This is still very much for very large data volume but maybe whatever they build around this (maybe a GBase segment for scientific data) could be use for lower data volume uploaded via net.

I love your blog.
Me2dvd says:

January 27, 2010 at 12:00 pm

Thanks for posting the slides. It is interesting. This is still very much for very large data volume but maybe whatever they build around this (maybe a GBase segment for scientific data) could be use for lower data volume uploaded via net.

I love your blog. very much for share
hotel bangkok says:

February 10, 2010 at 2:11 am

That is good. I love your blog. very much for share
tariely says:

March 15, 2010 at 10:28 am

аудиокниги скачать бесплатно
Alex says:

March 23, 2010 at 11:36 am

Hey guys
what happened to your project? Did you cancel it? Or does it runder under anoter name? This was a really cool idea.
los angeles acting class says:

April 3, 2010 at 2:48 pm

as i learned in university, it is all about the data.
Bangkok Hotels in Thailand says:

June 6, 2010 at 11:24 pm

This is still very much for very large data volume but maybe whatever they build around this. Thanks.
pakaian wanita says:

November 9, 2010 at 1:58 am

good post.. lets me bookmark this page.. thanks
Joshua Shriver says:

December 1, 2010 at 11:54 pm

This is interesting, but sadly 2 years old and I am still unable to find solid information on the project. Guessing it was abandoned. If not feel free to contact me, I’ll keep digging and emailing since I have a 1.2 TB dataset that a lot of people would like to see hosted.
uk replica watch says:

March 26, 2011 at 10:55 pm

great good article!
067999U says:

March 30, 2011 at 1:37 pm

Brilliant post! It was obviously inspiring, thus appreciate ones hard allow an improvement! I’m going to be sure to promote this particular having a numerous good friends whom I realize would like it.
Peter says:

June 27, 2011 at 9:46 pm

Google has so much control it is scary…..but we all need it.
Smartphone Apps says:

July 4, 2011 at 4:07 am

nice info.. i like your blogs
Desktop Wallpaper says:

July 4, 2011 at 4:08 am

very nice blog…. thanks for sharing
alcohol poisoning treatment says:

July 7, 2011 at 1:42 am

This is still very much for very large data volume Thanks for posting the slides. It is interesting. This is still very much
Pingback: vacuum cleaners consumer reports
Kewell says:

September 1, 2011 at 3:34 am

Great data, Thank you very much for posting.
Sarah Reynolds says:

October 11, 2011 at 1:21 pm

very interesting information, thanks for taking the time to put it together for us. Would definitely love to hear more from you guys!!
Preston Bacca says:

November 2, 2011 at 3:45 pm

Greetings! This is my first visit to your blog! We are a group of volunteers and starting a new project in a community in the same niche. Your blog provided us valuable information to work on. You have done a extraordinary job!

Comments are closed.

Deep, healthy lifespan extension

On the basics and contexts of extreme longevity