Summarizing the attempt to trace pieces of data in the Large Hadron Collider Grid:
While it’s true that files are given dates and times, figuring out where a packet of data came from is a lot more difficult than just reading the file name. Data are copied, split, copied again and deleted. Bits and pieces travel all over the Grid, often without any input from a human user.
Sounds in a way similar to what I’ve described as an ideal big data processing engine.
Original title and link: Big Data Flow in Large Hadron Collider Grid (NoSQL databases © myNoSQL)