<<O>>  Difference Topic WebHome (r1.48 - 16 May 2008 - LucMoreau)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 11 to 11

The second provenance challenge resulted in discussions where a consensus about a common data model began to emerge. This consensus, summarised at SecondWorkshopMinutes, has led to a proposed specification of a provenance data model and inference rules, the Open Provenance Model: OPM. A review period of this model is commencing in January 2008, with hope to agree on a data model and evaluate it a Third Provenance Challenge. Comments about the feedback can be found at OpenProvenanceModelReview.

Added:
>
>
The OpenProvenanceModelWorkshop will take place on Thursday 19th, just after IPAW, at the University of Utah, Salt Lake City.

Mailing list

A mailing list for challenge related issues has been set up. An archive is available from http://www.ipaw.info/mail/archive.php/. In order to subscribe please send

 <<O>>  Difference Topic WebHome (r1.47 - 18 Dec 2007 - LucMoreau)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance standardization at the International Provenance and Annotation Workshop (IPAW'06, www.ipaw.info), the community decided that it needs to understand the different representations used for provenance, its common aspects, and the reasons for its differences. As a result, the community agreed that a "Provenance Challenge" should be set to compare and understand existing approaches.

Changed:
<
<
The first challenge (details here and teams' results available from here) commenced 2006-June-19 and concluded in a workshop held on 2006-September-13 in Washington, DC. There was a total of 17 teams, contributing a diverse range of results. As part of the discussion at that workshop, it was decided to hold a second challenge, for which the focus would be interoperability between systems.
>
>
The First Provenance Challenge (details here and teams' results available from here) commenced 2006-June-19 and concluded in a workshop held on 2006-September-13 in Washington, DC. There was a total of 17 teams, contributing a diverse range of results. As part of the discussion at that workshop, it was decided to hold a second challenge, for which the focus would be interoperability between systems.

Changed:
<
<
The second provenance challenge commenced on 2006-December-12 and concluded on 2007-June-26 with a day-long workshop at High Performance Distributed Computing in Monterey, California, where teams presented and discussed the results. Please see the specification of the challenge, the agenda (including teams' presentations) and a summary of the technical points raised in discussion. The result data from all teams are available from the team pages.
>
>
The second provenance challenge commenced on 2006-December-12 and concluded on 2007-June-26 with a day-long workshop at High Performance Distributed Computing in Monterey, California, where teams presented and discussed the results. Please see the specification of the challenge, the agenda (including teams' presentations) and a summary of the technical points raised in discussion. The result data from all teams are available from the team pages.

The second provenance challenge resulted in discussions where a consensus about a common data model began to emerge. This consensus, summarised at SecondWorkshopMinutes, has led to a proposed specification of a provenance data model and inference rules, the Open Provenance Model: OPM. A review period of this model is commencing in January 2008, with hope to agree on a data model and evaluate it a Third Provenance Challenge. Comments about the feedback can be found at OpenProvenanceModelReview.


Mailing list

 <<O>>  Difference Topic WebHome (r1.46 - 30 Jul 2007 - SimonMiles)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 6 to 6

The first challenge (details here and teams' results available from here) commenced 2006-June-19 and concluded in a workshop held on 2006-September-13 in Washington, DC. There was a total of 17 teams, contributing a diverse range of results. As part of the discussion at that workshop, it was decided to hold a second challenge, for which the focus would be interoperability between systems.

Changed:
<
<

Second Provenance Challenge

The second provenance challenge commenced on 2006-December-12 and will conclude in June with a day-long workshop, where teams will assess the results. Please see this page for full details on the challenge. The deadline for uploading the first phase results is 2007-February-20 and the final results by the start of the workshop on 2007-June-26. If you are taking part, please subscribe to the mailing list (details below), as this is where we will announce details of the workshop and any clarifications to the challenge itself.

There is no need to have taken part in the first challenge to take part in the second: anyone who wants to is welcome to participate!

Contributions by Participating Teams

Each participating team will upload their material on the twiki, adopting the following template SecondChallengeTemplate. Each team should make a copy of the template and add their entry to the ParticipatingTeams page.

If you have not already done so for the first challenge, in order to create a TWiki page on which you can upload your results, you each have to register with the site at the URL below. It would be helpful if you could register sooner rather than later, even if you don't create a page or edit anything, so that we know who is taking part.

http://twiki.gridprovenance.org/bin/view/TWiki/TWikiRegistration

Provenance Challenge Workshop

A meeting to discuss the results of the second provenance challenge will be held as a workshop at the next High Performance Distributed Computing conference on the 25th and/or 26th June.

>
>
The second provenance challenge commenced on 2006-December-12 and concluded on 2007-June-26 with a day-long workshop at High Performance Distributed Computing in Monterey, California, where teams presented and discussed the results. Please see the specification of the challenge, the agenda (including teams' presentations) and a summary of the technical points raised in discussion. The result data from all teams are available from the team pages.

Mailing list

 <<O>>  Difference Topic WebHome (r1.45 - 29 Apr 2007 - SimonMiles)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 8 to 8

Second Provenance Challenge

Changed:
<
<
The second provenance challenge commenced on 2006-December-12 and will conclude in June with a day-long workshop, where teams will assess the results. Please see this page for full details on the challenge. The deadline for uploading the first phase results is 2007-February-20 and the final results by the start of the workshop on 2007-June-25 (or possibly 2007-June-26, to be confirmed). If you are taking part, please subscribe to the mailing list (details below), as this is where we will announce details of the workshop and any clarifications to the challenge itself.
>
>
The second provenance challenge commenced on 2006-December-12 and will conclude in June with a day-long workshop, where teams will assess the results. Please see this page for full details on the challenge. The deadline for uploading the first phase results is 2007-February-20 and the final results by the start of the workshop on 2007-June-26. If you are taking part, please subscribe to the mailing list (details below), as this is where we will announce details of the workshop and any clarifications to the challenge itself.

There is no need to have taken part in the first challenge to take part in the second: anyone who wants to is welcome to participate!

 <<O>>  Difference Topic WebHome (r1.44 - 03 Jan 2007 - SimonMiles)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 8 to 8

Second Provenance Challenge

Changed:
<
<
The second provenance challenge commenced on 2006-December-12 and will conclude in June with a day-long workshop, where teams will assess the results. Please see this page for full details on the challenge. The deadline for uploading the first phase results is 2007-January-31 and the final results by the start of the workshop on 2007-June-25 (or possibly 2007-June-26, to be confirmed). If you are taking part, please subscribe to the mailing list (details below), as this is where we will announce details of the workshop and any clarifications to the challenge itself.
>
>
The second provenance challenge commenced on 2006-December-12 and will conclude in June with a day-long workshop, where teams will assess the results. Please see this page for full details on the challenge. The deadline for uploading the first phase results is 2007-February-20 and the final results by the start of the workshop on 2007-June-25 (or possibly 2007-June-26, to be confirmed). If you are taking part, please subscribe to the mailing list (details below), as this is where we will announce details of the workshop and any clarifications to the challenge itself.

There is no need to have taken part in the first challenge to take part in the second: anyone who wants to is welcome to participate!

 <<O>>  Difference Topic WebHome (r1.43 - 11 Dec 2006 - SimonMiles)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance standardization at the International Provenance and Annotation Workshop (IPAW'06, www.ipaw.info), the community decided that it needs to understand the different representations used for provenance, its common aspects, and the reasons for its differences. As a result, the community agreed that a "Provenance Challenge" should be set to compare and understand existing approaches.

Changed:
<
<

First Provenance Challenge

>
>
The first challenge (details here and teams' results available from here) commenced 2006-June-19 and concluded in a workshop held on 2006-September-13 in Washington, DC. There was a total of 17 teams, contributing a diverse range of results. As part of the discussion at that workshop, it was decided to hold a second challenge, for which the focus would be interoperability between systems.

Changed:
<
<
The first provenance challenge commenced on 2006-June-19 and will conclude in September with a day-long workshop, where teams compare their approaches. Please see first provenance challenge for full details on the challenge. The deadline for uploading results is the start of the workshop, 13th September 2006.

Each participating team will upload their material on the twiki, adopting the following templage ChallengeTemplate. In order to create a TWiki page on which you can upload your results, you each have to register with the site at the URL below. It would be helpful if you could register sooner rather than later, even if you don't create a page or edit anything, so that we know who is taking part.

http://twiki.gridprovenance.org/bin/view/TWiki/TWikiRegistration

Provenance Challenge Workshop

>
>

Second Provenance Challenge


Changed:
<
<
A workshop to discuss the results of the first provenance challenge will be held at the next Global Grid Forum (GGF-18), to be held in Washington, September 11 to 14, 2006. The provenance challenge workshop itself will take place on the 13th and 14th September. Detailed schedule.
>
>
The second provenance challenge commenced on 2006-December-12 and will conclude in June with a day-long workshop, where teams will assess the results. Please see this page for full details on the challenge. The deadline for uploading the first phase results is 2007-January-31 and the final results by the start of the workshop on 2007-June-25 (or possibly 2007-June-26, to be confirmed). If you are taking part, please subscribe to the mailing list (details below), as this is where we will announce details of the workshop and any clarifications to the challenge itself.

Changed:
<
<
The workshop venue is room 147B of the conference facility (Washington Convention Center). There are details about getting to the convention center at here.
>
>
There is no need to have taken part in the first challenge to take part in the second: anyone who wants to is welcome to participate!

Contributions by Participating Teams

Changed:
<
<
Each team should make a copy of the ChallengeTemplate and produce their entry on the ParticipatingTeams page.
>
>
Each participating team will upload their material on the twiki, adopting the following template SecondChallengeTemplate. Each team should make a copy of the template and add their entry to the ParticipatingTeams page.

Changed:
<
<
<!--
  • REDUX, Database Research Group, MSR
  • MINDSWAP, Semantic Web Research Group, University of Maryland, College Park
  • Karma, Computer Science Department, Indiana University
  • CESNET, GRID research group, CESNET z.s.p.o. Prague, Czech Republic
  • myGrid, University of Manchester
  • VisTrails, University of Utah
  • Gridprovenance, Cardiff University
  • ES3, University of California, Santa Barbara
  • UPenn, University of Pennsylvania, Database Group
  • RWS, UC Davis and SDSC, California
  • DAKS, Genome Center, UC Davis, California
  • PASS, Harvard
  • SDG, Pacific Northwest National Lab
  • NcsaD2k and NcsaCi, National Center for Supercomputing Applications
  • UChicago, University of Chicago Computation Institute
  • Southampton, University of Southampton, PASOA and Provenance projects
  • USC/ISI, University Of Southern California/Information Sciences Institute
  • And more ...
  • ...
>
>
If you have not already done so for the first challenge, in order to create a TWiki page on which you can upload your results, you each have to register with the site at the URL below. It would be helpful if you could register sooner rather than later, even if you don't create a page or edit anything, so that we know who is taking part.

Changed:
<
<
-->

Provenance Queries Matrix

>
>
http://twiki.gridprovenance.org/bin/view/TWiki/TWikiRegistration

Changed:
<
<
The ProvenanceQueriesMatrix is a summary of how queries are supported by the different systems.
>
>

Provenance Challenge Workshop

A meeting to discuss the results of the second provenance challenge will be held as a workshop at the next High Performance Distributed Computing conference on the 25th and/or 26th June.


Added:
>
>

Mailing list


Deleted:
<
<

Mailing list


A mailing list for challenge related issues has been set up. An archive is available from http://www.ipaw.info/mail/archive.php/. In order to subscribe please send

Line: 59 to 34

Once subscribed messages can be sent to provenance-challenge@ipaw.info .
Deleted:
<
<

After the First Provenance Challenge

Participants please add to both of these!

Second Provenance Challenge

  • Details on collaborations between teams to show interoperability?


 <<O>>  Difference Topic WebHome (r1.42 - 06 Oct 2006 - SimonMiles)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 65 to 65

Deleted:
<
<

Second Provenance Challenge

 <<O>>  Difference Topic WebHome (r1.41 - 06 Oct 2006 - SteveMunroe)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 65 to 65

Added:
>
>

Second Provenance Challenge

 <<O>>  Difference Topic WebHome (r1.40 - 14 Sep 2006 - SimonMiles)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 60 to 59

Once subscribed messages can be sent to provenance-challenge@ipaw.info .
Added:
>
>

After the First Provenance Challenge

Participants please add to both of these!

Second Provenance Challenge

  • Details on collaborations between teams to show interoperability?


For information, other challenges have been previously set in other areas of computer science.

 <<O>>  Difference Topic WebHome (r1.39 - 13 Sep 2006 - LucMoreau)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 22 to 22

Contributions by Participating Teams

Changed:
<
<
Each team should make a copy of the ChallengeTemplate and produce their entry below.
>
>
Each team should make a copy of the ChallengeTemplate and produce their entry on the ParticipatingTeams page.

Added:
>
>
<!--

  • REDUX, Database Research Group, MSR
  • MINDSWAP, Semantic Web Research Group, University of Maryland, College Park
  • Karma, Computer Science Department, Indiana University
Line: 44 to 45

  • And more ...
  • ...
Changed:
<
<
>
>
-->

Provenance Queries Matrix

The ProvenanceQueriesMatrix is a summary of how queries are supported by the different systems.

 <<O>>  Difference Topic WebHome (r1.38 - 12 Sep 2006 - ShawnBowers)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 34 to 34

  • ES3, University of California, Santa Barbara
  • UPenn, University of Pennsylvania, Database Group
  • RWS, UC Davis and SDSC, California
Added:
>
>
  • DAKS, Genome Center, UC Davis, California

  • PASS, Harvard
  • SDG, Pacific Northwest National Lab
  • NcsaD2k and NcsaCi, National Center for Supercomputing Applications
 <<O>>  Difference Topic WebHome (r1.37 - 12 Sep 2006 - GaurangMehta)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 39 to 39

  • NcsaD2k and NcsaCi, National Center for Supercomputing Applications
  • UChicago, University of Chicago Computation Institute
  • Southampton, University of Southampton, PASOA and Provenance projects
Added:
>
>
  • USC/ISI, University Of Southern California/Information Sciences Institute

  • And more ...
  • ...
 <<O>>  Difference Topic WebHome (r1.36 - 12 Sep 2006 - SimonMiles)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 38 to 38

  • SDG, Pacific Northwest National Lab
  • NcsaD2k and NcsaCi, National Center for Supercomputing Applications
  • UChicago, University of Chicago Computation Institute
Added:
>
>
  • Southampton, University of Southampton, PASOA and Provenance projects

  • And more ...
  • ...
 <<O>>  Difference Topic WebHome (r1.35 - 12 Sep 2006 - MichaelWilde)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 37 to 37

  • PASS, Harvard
  • SDG, Pacific Northwest National Lab
  • NcsaD2k and NcsaCi, National Center for Supercomputing Applications
Added:
>
>
  • UChicago, University of Chicago Computation Institute

  • And more ...
  • ...
 <<O>>  Difference Topic WebHome (r1.34 - 12 Sep 2006 - JoeFutrelle)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 36 to 36

  • RWS, UC Davis and SDSC, California
  • PASS, Harvard
  • SDG, Pacific Northwest National Lab
Changed:
<
<
  • And another one
>
>
  • NcsaD2k and NcsaCi, National Center for Supercomputing Applications
  • And more ...

  • ...
 <<O>>  Difference Topic WebHome (r1.33 - 11 Sep 2006 - TaraGibson)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 35 to 35

  • UPenn, University of Pennsylvania, Database Group
  • RWS, UC Davis and SDSC, California
  • PASS, Harvard
Added:
>
>
  • SDG, Pacific Northwest National Lab

  • And another one
  • ...
 <<O>>  Difference Topic WebHome (r1.32 - 08 Sep 2006 - PassProject)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 34 to 34

  • ES3, University of California, Santa Barbara
  • UPenn, University of Pennsylvania, Database Group
  • RWS, UC Davis and SDSC, California
Added:
>
>

  • And another one
  • ...
 <<O>>  Difference Topic WebHome (r1.31 - 08 Sep 2006 - LucMoreau)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance standardization at the International Provenance and Annotation Workshop (IPAW'06, www.ipaw.info), the community decided that it needs to understand the different representations used for provenance, its common aspects, and the reasons for its differences. As a result, the community agreed that a "Provenance Challenge" should be set to compare and understand existing approaches.

Deleted:
<
<
A mailing list for challenge related issues has been set up. An archive is available from http://www.ipaw.info/mail/archive.php/. In order to subscribe please send

Once subscribed messages can be sent to provenance-challenge@ipaw.info .


First Provenance Challenge

Line: 22 to 16

Provenance Challenge Workshop

Changed:
<
<
A workshop to discuss the results of the first provenance challenge will be held at the next Global Grid Forum (GGF-18), to be held in Washington, September 11 to 14, 2006. The provenance challenge workshop itself will take place on the 13th and 14th September.
>
>
A workshop to discuss the results of the first provenance challenge will be held at the next Global Grid Forum (GGF-18), to be held in Washington, September 11 to 14, 2006. The provenance challenge workshop itself will take place on the 13th and 14th September. Detailed schedule.

The workshop venue is room 147B of the conference facility (Washington Convention Center). There are details about getting to the convention center at here.

Line: 49 to 43

The ProvenanceQueriesMatrix is a summary of how queries are supported by the different systems.
Added:
>
>

Mailing list

A mailing list for challenge related issues has been set up. An archive is available from http://www.ipaw.info/mail/archive.php/. In order to subscribe please send

Once subscribed messages can be sent to provenance-challenge@ipaw.info .



For information, other challenges have been previously set in other areas of computer science.

 <<O>>  Difference Topic WebHome (r1.30 - 07 Sep 2006 - NorbertPodhorszki)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 39 to 39

  • Gridprovenance, Cardiff University
  • ES3, University of California, Santa Barbara
  • UPenn, University of Pennsylvania, Database Group
Added:
>
>
  • RWS, UC Davis and SDSC, California

  • And another one
  • ...
 <<O>>  Difference Topic WebHome (r1.29 - 07 Sep 2006 - SimonMiles)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 24 to 24

A workshop to discuss the results of the first provenance challenge will be held at the next Global Grid Forum (GGF-18), to be held in Washington, September 11 to 14, 2006. The provenance challenge workshop itself will take place on the 13th and 14th September.

Added:
>
>
The workshop venue is room 147B of the conference facility (Washington Convention Center). There are details about getting to the convention center at here.

Contributions by Participating Teams

Each team should make a copy of the ChallengeTemplate and produce their entry below.

 <<O>>  Difference Topic WebHome (r1.28 - 06 Sep 2006 - SarahCohenBoulakia)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 36 to 36

Added:
>
>
  • UPenn, University of Pennsylvania, Database Group

  • And another one
  • ...
 <<O>>  Difference Topic WebHome (r1.27 - 06 Sep 2006 - JamesFrew)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 35 to 35

Added:
>
>
  • ES3, University of California, Santa Barbara

  • And another one
  • ...
 <<O>>  Difference Topic WebHome (r1.26 - 04 Sep 2006 - VikasDeora)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 34 to 34

Changed:
<
<
  • Create here your own entry
>
>

  • And another one
  • ...
 <<O>>  Difference Topic WebHome (r1.25 - 31 Aug 2006 - EmanueleSantos)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 33 to 33

Added:
>
>

  • Create here your own entry
  • And another one
  • ...
 <<O>>  Difference Topic WebHome (r1.24 - 29 Aug 2006 - JunZhao)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 32 to 32

  • MINDSWAP, Semantic Web Research Group, University of Maryland, College Park
  • Karma, Computer Science Department, Indiana University
  • CESNET, GRID research group, CESNET z.s.p.o. Prague, Czech Republic
Added:
>
>
  • myGrid, University of Manchester

  • Create here your own entry
  • And another one
  • ...
 <<O>>  Difference Topic WebHome (r1.23 - 22 Aug 2006 - JiriSitera)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 31 to 31

  • REDUX, Database Research Group, MSR
  • MINDSWAP, Semantic Web Research Group, University of Maryland, College Park
  • Karma, Computer Science Department, Indiana University
Added:
>
>

  • Create here your own entry
  • And another one
  • ...
 <<O>>  Difference Topic WebHome (r1.22 - 21 Aug 2006 - SimonMiles)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 13 to 13

First Provenance Challenge

Changed:
<
<
The first provenance challenge commenced on 2006-June-19 and will conclude in September with a day-long workshop, where teams compare their approaches. Please see first provenance challenge for full details on the challenge.
>
>
The first provenance challenge commenced on 2006-June-19 and will conclude in September with a day-long workshop, where teams compare their approaches. Please see first provenance challenge for full details on the challenge. The deadline for uploading results is the start of the workshop, 13th September 2006.

Each participating team will upload their material on the twiki, adopting the following templage ChallengeTemplate. In order to create a TWiki page on which you can upload your results, you each have to register with the site at the URL below. It would be helpful if you could register sooner rather than later, even if you don't create a page or edit anything, so that we know who is taking part.

Line: 22 to 22

Provenance Challenge Workshop

Changed:
<
<
A workshop to discuss the results of the first provenance challenge will be held at the next Global Grid Forum (GGF-18), to be held in Washington, September 11 to 14, 2006. The exact date of the workshop within the forum will be announced shortly.
>
>
A workshop to discuss the results of the first provenance challenge will be held at the next Global Grid Forum (GGF-18), to be held in Washington, September 11 to 14, 2006. The provenance challenge workshop itself will take place on the 13th and 14th September.

Contributions by Participating Teams

 <<O>>  Difference Topic WebHome (r1.21 - 24 Jul 2006 - YogeshSimmhan)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 30 to 30

  • REDUX, Database Research Group, MSR
  • MINDSWAP, Semantic Web Research Group, University of Maryland, College Park
Added:
>
>
  • Karma, Computer Science Department, Indiana University

  • Create here your own entry
  • And another one
  • ...
 <<O>>  Difference Topic WebHome (r1.20 - 21 Jul 2006 - JenniferGolbeck)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 29 to 29

Each team should make a copy of the ChallengeTemplate and produce their entry below.

  • REDUX, Database Research Group, MSR
Added:
>
>
  • MINDSWAP, Semantic Web Research Group, University of Maryland, College Park

  • Create here your own entry
  • And another one
  • ...
 <<O>>  Difference Topic WebHome (r1.19 - 18 Jul 2006 - LucianoDigiampietri)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 28 to 28

Each team should make a copy of the ChallengeTemplate and produce their entry below.

Changed:
<
<
>
>
  • REDUX, Database Research Group, MSR

  • Create here your own entry
  • And another one
  • ...
 <<O>>  Difference Topic WebHome (r1.18 - 18 Jul 2006 - LucianoDigiampietri)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 28 to 28

Each team should make a copy of the ChallengeTemplate and produce their entry below.

Added:
>
>

  • Create here your own entry
  • And another one
  • ...
 <<O>>  Difference Topic WebHome (r1.17 - 30 Jun 2006 - SimonMiles)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

 <<O>>  Difference Topic WebHome (r1.16 - 28 Jun 2006 - SimonMiles)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 13 to 13

First Provenance Challenge

Changed:
<
<
The first provenance challenge commences 2006-Jun-15 and will conclude in September with a day-long workshop, where teams compare their approaches.
>
>
The first provenance challenge commenced on 2006-June-19 and will conclude in September with a day-long workshop, where teams compare their approaches. Please see first provenance challenge for full details on the challenge.

Each participating team will upload their material on the twiki, adopting the following templage ChallengeTemplate. In order to create a TWiki page on which you can upload your results, you each have to register with the site at the URL below. It would be helpful if you could register sooner rather than later, even if you don't create a page or edit anything, so that we know who is taking part.

http://twiki.gridprovenance.org/bin/view/TWiki/TWikiRegistration


Deleted:
<
<
Each participating team will upload their material on the twiki, adopting the following templage ChallengeTemplate.

Provenance Challenge Workshop

 <<O>>  Difference Topic WebHome (r1.15 - 27 Jun 2006 - SimonMiles)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 17 to 17

Each participating team will upload their material on the twiki, adopting the following templage ChallengeTemplate.

Added:
>
>

Provenance Challenge Workshop

A workshop to discuss the results of the first provenance challenge will be held at the next Global Grid Forum (GGF-18), to be held in Washington, September 11 to 14, 2006. The exact date of the workshop within the forum will be announced shortly.


Contributions by Participating Teams

Each team should make a copy of the ChallengeTemplate and produce their entry below.

 <<O>>  Difference Topic WebHome (r1.14 - 13 Jun 2006 - LucMoreau)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance standardization at the International Provenance and Annotation Workshop (IPAW'06, www.ipaw.info), the community decided that it needs to understand the different representations used for provenance, its common aspects, and the reasons for its differences. As a result, the community agreed that a "Provenance Challenge" should be set to compare and understand existing approaches.

Changed:
<
<
A mailing list for challenge related issues has been set up. An archive will be made availalbe soon. In order to subscribe please send
>
>
A mailing list for challenge related issues has been set up. An archive is available from http://www.ipaw.info/mail/archive.php/. In order to subscribe please send

 <<O>>  Difference Topic WebHome (r1.13 - 07 Jun 2006 - PaulGroth)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance standardization at the International Provenance and Annotation Workshop (IPAW'06, www.ipaw.info), the community decided that it needs to understand the different representations used for provenance, its common aspects, and the reasons for its differences. As a result, the community agreed that a "Provenance Challenge" should be set to compare and understand existing approaches.

Changed:
<
<
A mailing list for challenge related issues has been set up and its archive is available from XXX. In order to subscribe ....
>
>
A mailing list for challenge related issues has been set up. An archive will be made availalbe soon. In order to subscribe please send

Once subscribed messages can be sent to provenance-challenge@ipaw.info .


First Provenance Challenge

 <<O>>  Difference Topic WebHome (r1.12 - 06 Jun 2006 - LucMoreau)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance standardization at the International Provenance and Annotation Workshop (IPAW'06, www.ipaw.info), the community decided that it needs to understand the different representations used for provenance, its common aspects, and the reasons for its differences. As a result, the community agreed that a "Provenance Challenge" should be set to compare and understand existing approaches.

Added:
>
>
A mailing list for challenge related issues has been set up and its archive is available from XXX. In order to subscribe ....

First Provenance Challenge

Line: 14 to 16

Contributions by Participating Teams

Added:
>
>
Each team should make a copy of the ChallengeTemplate and produce their entry below.

Added:
>
>
  • Create here your own entry
  • And another one
  • ...

Provenance Queries Matrix

The ProvenanceQueriesMatrix is a summary of how queries are supported by the different systems.



For information, other challenges have been previously set in other areas of computer science.

Deleted:
<
<
META FILEATTACHMENT BrainAtlas?.png attr="" comment="Brain Atlas workflow" date="1147791744" path="BrainAtlas.png" size="5220" user="SimonMiles" version="1.1"
 <<O>>  Difference Topic WebHome (r1.11 - 31 May 2006 - LucMoreau)

Provenance Challenge Wiki

Changed:
<
<
Welcome to the home of TWiki.Challenge. This is a web-based collaboration area for the Provenance Challenge. This Web is public, but to edit, you need to be registered.
>
>
Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance standardization at the International Provenance and Annotation Workshop (IPAW'06, www.ipaw.info), the community decided that it needs to understand the different representations used for provenance, its common aspects, and the reasons for its differences. As a result, the community agreed that a "Provenance Challenge" should be set to compare and understand existing approaches.

First Provenance Challenge

Changed:
<
<
The first provenance challenge commences 2006-July-15. The full details can be found here.
>
>
The first provenance challenge commences 2006-Jun-15 and will conclude in September with a day-long workshop, where teams compare their approaches.

Each participating team will upload their material on the twiki, adopting the following templage ChallengeTemplate.

Contributions by Participating Teams


Deleted:
<
<

Other Challenges


Deleted:
<
<
http://sws-challenge.org/wiki/index.php/Main_Page

Deleted:
<
<
This is a Music-IR contest that I am involved in.

Changed:
<
<
http://www.music-ir.org/evaluation/ http://www.music-ir.org/mirexwiki/index.php/Main_Page
>
>


Changed:
<
<
http://fling-l.seas.upenn.edu/~plclub/cgi-bin/poplmark/
>
>
For information, other challenges have been previously set in other areas of computer science.

META FILEATTACHMENT BrainAtlas?.png attr="" comment="Brain Atlas workflow" date="1147791744" path="BrainAtlas.png" size="5220" user="SimonMiles" version="1.1"
 <<O>>  Difference Topic WebHome (r1.10 - 22 May 2006 - SimonMiles)
Changed:
<
<
Welcome to the home of TWiki.Challenge. This is a web-based collaboration area for the Provenance Challenge. This Web is public, but to edit, you need to be registered.

Provenance Challenge

The provenance challenge aims to establish a common understanding of the capabilities (and limitations) of available provenance-related technologies. In particular, once the systems have been compared, we aim to have an understanding of the following.

  • The representations that we each use to document details of processes that have occurred (e.g. workflows run)
  • The capabilities of each system in answering provenance-related queries
  • What each system considers to be within scope of the topic of provenance (regardless of whether the system can yet achieve all problems in that scope)

While we hope that all participating projects can contribute to the above aims, we explicitly do not assume that every system will be best evaluated by the same criteria, or have the same definition of provenance, ideal way to illustrate its potential or scope in terms of which queries are considered relevant.

To help achieve the aims, we define a single example process that forms the basis of the challenge. Here, we use the term process to denote a series of procedures being performed in a system, each taking some data as input and producing other data as output. We do not assume that these procedures must use some particular form of technology (EXE files, Web Services etc.) or that the process is explicitly defined in a workflow script, but individual projects will interpret the technologies through which the process is executed. While based on a real workflow, the procedures in the example process are expected to be implemented as "dummies", i.e. we provide the input, output and intermediate data and the projects use fake procedures that take the right input and produce the right output. If a project wishes to enact the real process, this is also possible.

In addition to this, we will define a set of core queries that all partipicipants should show how they address, so we can compare systems. The process and core queries will be decided on collectively by the challenge participants.

Each project participating in the challenge will have its own page on this TWiki where they can inform the rest of their efforts in meeting the challenge. During the provenance challenge, we expect the participants to upload the following to their page, to then allow comparison.

  • Representations of documentation of processes or provenance for the example process
  • Extensions to the example process to best illustrate the unique aspects of their system
  • Additional queries (beyond the core queries) that illustrate the scope of their system
  • Any categorisation of queries that the project considers to have practical value

In addition to the above project pages, the result of the challenge will take the form of the following tables by which direct comparison between systems can be made.

  • A matrix of queries vs systems, indicating for each that: (1) the query can be answered by the system, (2) the system cannot answer the query now but considers it relevant, (3) the query is not relevant to the project.

Example Process

>
>

Provenance Challenge Wiki


Changed:
<
<
We propose an example process for creating population-based "brain atlases" from the fMRI Data Center's archive of high resolution anatomical MR data, using a multi-stage I/O intensive pipeline that warps the anatomical features of the brain of multiple subjects into a standard space. The process is shown below.

BrainAtlas.png

It is comprised of procedures, shown as orange ovals, and data items flowing between them, shown as trapeziums. It can be seen as five stages, where each stage is depicted as a horizontal row of the same procedure in the figure. The procedures employ the AIR (automated image registration) suite to create an averaged brain from a collection of high resolution anatomical data. In addition to the data items shown in the figure, there are other inputs to procedures (constant string options): we will make these evident in the workflow in the future.

The inputs to a workflow are a set of new brain images (3.a to 5.a) and a single reference brain image (6.a). All input images are 3D scans of a brain of varying resolutions, so that different features are evident. For each image, there is the actual image (.i) and the header/metadata information for that image (.h). The stages of the workflow do as follows.

  1. For each new brain image, it is compared to the reference image to determine how the new image should be warped, i.e. the position and shape of the image adjusted, to match the reference brain. The output of each procedure in the stage is a warp parameter set defining the warp to be done.
  2. For each warp parameter set, the actual transformation is done by creating a new version of the original new brain image with the configuration defined in the warp parameter set. The output is a warped image.
  3. All the warped images are averaged into one single image.
  4. For each