This is a static archive of the previous Open Grid Forum GridForge content management system saved from host forge.ogf.org file /sf/wiki/do/viewPage/projects.pgi-wg/wiki/ReqNF14 at Fri, 04 Nov 2022 17:47:42 GMT SourceForge : View Wiki Page: ReqNF14

Project Home

Tracker

Documents

Tasks

Source Code

Discussions

File Releases

Wiki

Project Admin
Search Wiki Pages Project: pgi-wg     Wiki > ReqNF14 > View Wiki Page
wiki2259: ReqNF14

Req. Nb ID Description Source Areas Dependencies Status Date
NF14 155 Data is equally important as compute NSF 08-571     Open (unclear) 2010-04-29

Andrew Grimshaw on 2010-03-20

  • Propose requirement with title  'Data is equally important as compute'

Etienne Urbah's position on 2010-04-07

  • In fact, Computer science has progressively moved from old days 'Number crunching' to current 'Information processing' with more and more emphasis on Data management
  • Just an example :
    - At the beginning of hard disks, data was accessed directly by applications on disk tracks
    - Then Files encapsulating several (non-contiguous) sectors of disk tracks were invented
    - Then Names were given to files, and Catalogs were created to consistently store and reference file names with their sector allocations
    - Also Databases where invented to manage large amounts of data having a known structure, and manage simultaneous access to chunks of data by different users
    - ...
  • My professional experience proves indeed that 'Data is MORE important than compute' :
    That is essential as soon as we have to consider Access rights management, Transactional applications, Complex workflows, ... which are increasingly required by Scientific communities
  • In particular, the success of EGEE does NOT primarily come from its gLite middleware, but from its design as 'Distributed Data Storage and Sharing'
    - permitting access to this distributed data from any gLite client, and
    - permitting, but NOT requiring, submission of remote computing activities (jobs), which MAY use this distributed and shared data
  • For clarity, I suggest to change the original title to
    'Data is more important than compute.  The Execution Service MUST NOT try to perform full Data management, but MUST focus only on the management of Activities, which MUST rely on dedicated Data services for Data management'
  • Anyway, I vote FOR this requirement

Morris, Balazs and Etienne on 2010-04-27

  • Spreadsheet ID = 155

Amsterdam meeting on 2010-04-29

  • Open  (still unclear)
 



Versions Associations Attachments Back Links  
Version Version Comment Created By
Version 3 Still unclear Etienne URBAH - 05/05/2010
Version 2 ! Morris, Balazs and Etienne on 2010-04-27 * Spreadsheet ID = 155 ! Amsterdam meeting on 2010-04-29 * Open  (clear, but NO agreement found yet) Etienne URBAH - 05/04/2010
Version 1 Etienne URBAH - 04/07/2010



The Open Grid Forum Contact Webmaster | Report a problem | GridForge Help
This is a static archive of the previous Open Grid Forum GridForge content management system saved from host forge.ogf.org file /sf/wiki/do/viewPage/projects.pgi-wg/wiki/ReqNF14 at Fri, 04 Nov 2022 17:47:49 GMT