<<O>>  Difference Topic WebHome (r1.52 - 01 Dec 2009 - LucMoreau)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 6 to 6

  1. The first provenance challenge (details here and teams' results available from here) commenced 2006-June-19 and concluded in a workshop held on 2006-September-13 in Washington, DC. There was a total of 17 teams, contributing a diverse range of results. As part of the discussion at that workshop, it was decided to hold a second challenge, for which the focus would be interoperability between systems.
  2. The second provenance challenge commenced on 2006-December-12 and concluded on 2007-June-26 with a day-long workshop at High Performance Distributed Computing in Monterey, California, where teams presented and discussed the results. Please see the specification of the challenge, the agenda (including teams' presentations) and a summary of the technical points raised in discussion. The result data from all teams are available from the team pages.

    The second provenance challenge resulted in discussions where a consensus about a common data model began to emerge. This consensus, summarised at SecondWorkshopMinutes, has led to a proposed specification of a provenance data model and inference rules, the Open Provenance Model: OPM. A review period of this model is commencing in January 2008, with hope to agree on a data model and evaluate it a Third Provenance Challenge. Comments about the feedback can be found at OpenProvenanceModelReview.

    The OpenProvenanceModelWorkshop took place on Thursday July 19th, just after IPAW, at the University of Utah, Salt Lake City. Minutes of the workshop can be found at FirstOPMWorkshopMinutes. It resulted in a minor revision of the OPM specification.

Changed:
<
<
  1. The third provenance challenge was launched on 2009-March-03 to evaluate the Open Provenance Model practically, from an inter-operability viewpoint.
>
>
  1. The third provenance challenge was launched on 2009-March-03 to evaluate the Open Provenance Model practically, from an inter-operability viewpoint. It resulted in a workshop to discuss findings, and will complete with a special issue on Using the Open Provenance Model to Address Interoperability Challenges.
  2. The fourth provenance challenge will start in early 2010 and will be defined in a scoping workshop colocated with IPAW'2010.

Mailing list

 <<O>>  Difference Topic WebHome (r1.51 - 19 Mar 2009 - LucMoreau)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance standardization at the International Provenance and Annotation Workshop (IPAW'06, www.ipaw.info), the community decided that it needs to understand the different representations used for provenance, its common aspects, and the reasons for its differences. As a result, the community agreed that a "Provenance Challenge" should be set to compare and understand existing approaches.

Changed:
<
<
The First Provenance Challenge (details here and teams' results available from here) commenced 2006-June-19 and concluded in a workshop held on 2006-September-13 in Washington, DC. There was a total of 17 teams, contributing a diverse range of results. As part of the discussion at that workshop, it was decided to hold a second challenge, for which the focus would be interoperability between systems.
>
>
  1. The first provenance challenge (details here and teams' results available from here) commenced 2006-June-19 and concluded in a workshop held on 2006-September-13 in Washington, DC. There was a total of 17 teams, contributing a diverse range of results. As part of the discussion at that workshop, it was decided to hold a second challenge, for which the focus would be interoperability between systems.
  2. The second provenance challenge commenced on 2006-December-12 and concluded on 2007-June-26 with a day-long workshop at High Performance Distributed Computing in Monterey, California, where teams presented and discussed the results. Please see the specification of the challenge, the agenda (including teams' presentations) and a summary of the technical points raised in discussion. The result data from all teams are available from the team pages.

    The second provenance challenge resulted in discussions where a consensus about a common data model began to emerge. This consensus, summarised at SecondWorkshopMinutes, has led to a proposed specification of a provenance data model and inference rules, the Open Provenance Model: OPM. A review period of this model is commencing in January 2008, with hope to agree on a data model and evaluate it a Third Provenance Challenge. Comments about the feedback can be found at OpenProvenanceModelReview.

    The OpenProvenanceModelWorkshop took place on Thursday July 19th, just after IPAW, at the University of Utah, Salt Lake City. Minutes of the workshop can be found at FirstOPMWorkshopMinutes. It resulted in a minor revision of the OPM specification.

  3. The third provenance challenge was launched on 2009-March-03 to evaluate the Open Provenance Model practically, from an inter-operability viewpoint.

Deleted:
<
<
The second provenance challenge commenced on 2006-December-12 and concluded on 2007-June-26 with a day-long workshop at High Performance Distributed Computing in Monterey, California, where teams presented and discussed the results. Please see the specification of the challenge, the agenda (including teams' presentations) and a summary of the technical points raised in discussion. The result data from all teams are available from the team pages.

The second provenance challenge resulted in discussions where a consensus about a common data model began to emerge. This consensus, summarised at SecondWorkshopMinutes, has led to a proposed specification of a provenance data model and inference rules, the Open Provenance Model: OPM. A review period of this model is commencing in January 2008, with hope to agree on a data model and evaluate it a Third Provenance Challenge. Comments about the feedback can be found at OpenProvenanceModelReview.

The OpenProvenanceModelWorkshop took place on Thursday July 19th, just after IPAW, at the University of Utah, Salt Lake City. Minutes of the workshop can be found at FirstOPMWorkshopMinutes. It resulted in a minor revision of the OPM specification.


Mailing list

 <<O>>  Difference Topic WebHome (r1.50 - 27 Nov 2008 - LucMoreau)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 15 to 15

Mailing list

Changed:
<
<
A mailing list for challenge related issues has been set up. An archive is available from http://www.ipaw.info/mail/archive.php/. In order to subscribe please send
>
>
A mailing list for challenge related issues has been set up. In order to subscribe please send

Once subscribed messages can be sent to provenance-challenge@ipaw.info .

Added:
>
>

Mailing list archive

An archive is available from



 <<O>>  Difference Topic WebHome (r1.49 - 17 Jul 2008 - LucMoreau)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 11 to 11

The second provenance challenge resulted in discussions where a consensus about a common data model began to emerge. This consensus, summarised at SecondWorkshopMinutes, has led to a proposed specification of a provenance data model and inference rules, the Open Provenance Model: OPM. A review period of this model is commencing in January 2008, with hope to agree on a data model and evaluate it a Third Provenance Challenge. Comments about the feedback can be found at OpenProvenanceModelReview.

Changed:
<
<
The OpenProvenanceModelWorkshop will take place on Thursday 19th, just after IPAW, at the University of Utah, Salt Lake City.
>
>
The OpenProvenanceModelWorkshop took place on Thursday July 19th, just after IPAW, at the University of Utah, Salt Lake City. Minutes of the workshop can be found at FirstOPMWorkshopMinutes. It resulted in a minor revision of the OPM specification.

Mailing list

 <<O>>  Difference Topic WebHome (r1.48 - 16 May 2008 - LucMoreau)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 11 to 11

The second provenance challenge resulted in discussions where a consensus about a common data model began to emerge. This consensus, summarised at SecondWorkshopMinutes, has led to a proposed specification of a provenance data model and inference rules, the Open Provenance Model: OPM. A review period of this model is commencing in January 2008, with hope to agree on a data model and evaluate it a Third Provenance Challenge. Comments about the feedback can be found at OpenProvenanceModelReview.

Added:
>
>
The OpenProvenanceModelWorkshop will take place on Thursday 19th, just after IPAW, at the University of Utah, Salt Lake City.

Mailing list

A mailing list for challenge related issues has been set up. An archive is available from http://www.ipaw.info/mail/archive.php/. In order to subscribe please send

 <<O>>  Difference Topic WebHome (r1.47 - 18 Dec 2007 - LucMoreau)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance standardization at the International Provenance and Annotation Workshop (IPAW'06, www.ipaw.info), the community decided that it needs to understand the different representations used for provenance, its common aspects, and the reasons for its differences. As a result, the community agreed that a "Provenance Challenge" should be set to compare and understand existing approaches.

Changed:
<
<
The first challenge (details here and teams' results available from here) commenced 2006-June-19 and concluded in a workshop held on 2006-September-13 in Washington, DC. There was a total of 17 teams, contributing a diverse range of results. As part of the discussion at that workshop, it was decided to hold a second challenge, for which the focus would be interoperability between systems.
>
>
The First Provenance Challenge (details here and teams' results available from here) commenced 2006-June-19 and concluded in a workshop held on 2006-September-13 in Washington, DC. There was a total of 17 teams, contributing a diverse range of results. As part of the discussion at that workshop, it was decided to hold a second challenge, for which the focus would be interoperability between systems.

Changed:
<
<
The second provenance challenge commenced on 2006-December-12 and concluded on 2007-June-26 with a day-long workshop at High Performance Distributed Computing in Monterey, California, where teams presented and discussed the results. Please see the specification of the challenge, the agenda (including teams' presentations) and a summary of the technical points raised in discussion. The result data from all teams are available from the team pages.
>
>
The second provenance challenge commenced on 2006-December-12 and concluded on 2007-June-26 with a day-long workshop at High Performance Distributed Computing in Monterey, California, where teams presented and discussed the results. Please see the specification of the challenge, the agenda (including teams' presentations) and a summary of the technical points raised in discussion. The result data from all teams are available from the team pages.

The second provenance challenge resulted in discussions where a consensus about a common data model began to emerge. This consensus, summarised at SecondWorkshopMinutes, has led to a proposed specification of a provenance data model and inference rules, the Open Provenance Model: OPM. A review period of this model is commencing in January 2008, with hope to agree on a data model and evaluate it a Third Provenance Challenge. Comments about the feedback can be found at OpenProvenanceModelReview.


Mailing list

 <<O>>  Difference Topic WebHome (r1.46 - 30 Jul 2007 - SimonMiles)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 6 to 6

The first challenge (details here and teams' results available from here) commenced 2006-June-19 and concluded in a workshop held on 2006-September-13 in Washington, DC. There was a total of 17 teams, contributing a diverse range of results. As part of the discussion at that workshop, it was decided to hold a second challenge, for which the focus would be interoperability between systems.

Changed:
<
<

Second Provenance Challenge

The second provenance challenge commenced on 2006-December-12 and will conclude in June with a day-long workshop, where teams will assess the results. Please see this page for full details on the challenge. The deadline for uploading the first phase results is 2007-February-20 and the final results by the start of the workshop on 2007-June-26. If you are taking part, please subscribe to the mailing list (details below), as this is where we will announce details of the workshop and any clarifications to the challenge itself.

There is no need to have taken part in the first challenge to take part in the second: anyone who wants to is welcome to participate!

Contributions by Participating Teams

Each participating team will upload their material on the twiki, adopting the following template SecondChallengeTemplate. Each team should make a copy of the template and add their entry to the ParticipatingTeams page.

If you have not already done so for the first challenge, in order to create a TWiki page on which you can upload your results, you each have to register with the site at the URL below. It would be helpful if you could register sooner rather than later, even if you don't create a page or edit anything, so that we know who is taking part.

http://twiki.gridprovenance.org/bin/view/TWiki/TWikiRegistration

Provenance Challenge Workshop

A meeting to discuss the results of the second provenance challenge will be held as a workshop at the next High Performance Distributed Computing conference on the 25th and/or 26th June.

>
>
The second provenance challenge commenced on 2006-December-12 and concluded on 2007-June-26 with a day-long workshop at High Performance Distributed Computing in Monterey, California, where teams presented and discussed the results. Please see the specification of the challenge, the agenda (including teams' presentations) and a summary of the technical points raised in discussion. The result data from all teams are available from the team pages.

Mailing list

 <<O>>  Difference Topic WebHome (r1.45 - 29 Apr 2007 - SimonMiles)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 8 to 8

Second Provenance Challenge

Changed:
<
<
The second provenance challenge commenced on 2006-December-12 and will conclude in June with a day-long workshop, where teams will assess the results. Please see this page for full details on the challenge. The deadline for uploading the first phase results is 2007-February-20 and the final results by the start of the workshop on 2007-June-25 (or possibly 2007-June-26, to be confirmed). If you are taking part, please subscribe to the mailing list (details below), as this is where we will announce details of the workshop and any clarifications to the challenge itself.
>
>
The second provenance challenge commenced on 2006-December-12 and will conclude in June with a day-long workshop, where teams will assess the results. Please see this page for full details on the challenge. The deadline for uploading the first phase results is 2007-February-20 and the final results by the start of the workshop on 2007-June-26. If you are taking part, please subscribe to the mailing list (details below), as this is where we will announce details of the workshop and any clarifications to the challenge itself.

There is no need to have taken part in the first challenge to take part in the second: anyone who wants to is welcome to participate!

 <<O>>  Difference Topic WebHome (r1.44 - 03 Jan 2007 - SimonMiles)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 8 to 8

Second Provenance Challenge

Changed:
<
<
The second provenance challenge commenced on 2006-December-12 and will conclude in June with a day-long workshop, where teams will assess the results. Please see this page for full details on the challenge. The deadline for uploading the first phase results is 2007-January-31 and the final results by the start of the workshop on 2007-June-25 (or possibly 2007-June-26, to be confirmed). If you are taking part, please subscribe to the mailing list (details below), as this is where we will announce details of the workshop and any clarifications to the challenge itself.
>
>
The second provenance challenge commenced on 2006-December-12 and will conclude in June with a day-long workshop, where teams will assess the results. Please see this page for full details on the challenge. The deadline for uploading the first phase results is 2007-February-20 and the final results by the start of the workshop on 2007-June-25 (or possibly 2007-June-26, to be confirmed). If you are taking part, please subscribe to the mailing list (details below), as this is where we will announce details of the workshop and any clarifications to the challenge itself.

There is no need to have taken part in the first challenge to take part in the second: anyone who wants to is welcome to participate!

 <<O>>  Difference Topic WebHome (r1.43 - 11 Dec 2006 - SimonMiles)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance standardization at the International Provenance and Annotation Workshop (IPAW'06, www.ipaw.info), the community decided that it needs to understand the different representations used for provenance, its common aspects, and the reasons for its differences. As a result, the community agreed that a "Provenance Challenge" should be set to compare and understand existing approaches.

Changed:
<
<

First Provenance Challenge

>
>
The first challenge (details here and teams' results available from here) commenced 2006-June-19 and concluded in a workshop held on 2006-September-13 in Washington, DC. There was a total of 17 teams, contributing a diverse range of results. As part of the discussion at that workshop, it was decided to hold a second challenge, for which the focus would be interoperability between systems.

Changed:
<
<
The first provenance challenge commenced on 2006-June-19 and will conclude in September with a day-long workshop, where teams compare their approaches. Please see first provenance challenge for full details on the challenge. The deadline for uploading results is the start of the workshop, 13th September 2006.

Each participating team will upload their material on the twiki, adopting the following templage ChallengeTemplate. In order to create a TWiki page on which you can upload your results, you each have to register with the site at the URL below. It would be helpful if you could register sooner rather than later, even if you don't create a page or edit anything, so that we know who is taking part.

http://twiki.gridprovenance.org/bin/view/TWiki/TWikiRegistration

Provenance Challenge Workshop

>
>

Second Provenance Challenge


Changed:
<
<
A workshop to discuss the results of the first provenance challenge will be held at the next Global Grid Forum (GGF-18), to be held in Washington, September 11 to 14, 2006. The provenance challenge workshop itself will take place on the 13th and 14th September. Detailed schedule.
>
>
The second provenance challenge commenced on 2006-December-12 and will conclude in June with a day-long workshop, where teams will assess the results. Please see this page for full details on the challenge. The deadline for uploading the first phase results is 2007-January-31 and the final results by the start of the workshop on 2007-June-25 (or possibly 2007-June-26, to be confirmed). If you are taking part, please subscribe to the mailing list (details below), as this is where we will announce details of the workshop and any clarifications to the challenge itself.

Changed:
<
<
The workshop venue is room 147B of the conference facility (Washington Convention Center). There are details about getting to the convention center at here.
>
>
There is no need to have taken part in the first challenge to take part in the second: anyone who wants to is welcome to participate!

Contributions by Participating Teams

Changed:
<
<
Each team should make a copy of the ChallengeTemplate and produce their entry on the ParticipatingTeams page.
>
>
Each participating team will upload their material on the twiki, adopting the following template SecondChallengeTemplate. Each team should make a copy of the template and add their entry to the ParticipatingTeams page.

Changed:
<
<
<!--
  • REDUX, Database Research Group, MSR
  • MINDSWAP, Semantic Web Research Group, University of Maryland, College Park
  • Karma, Computer Science Department, Indiana University
  • CESNET, GRID research group, CESNET z.s.p.o. Prague, Czech Republic
  • myGrid, University of Manchester
  • VisTrails, University of Utah
  • Gridprovenance, Cardiff University
  • ES3, University of California, Santa Barbara
  • UPenn, University of Pennsylvania, Database Group
  • RWS, UC Davis and SDSC, California
  • DAKS, Genome Center, UC Davis, California
  • PASS, Harvard
  • SDG, Pacific Northwest National Lab
  • NcsaD2k and NcsaCi, National Center for Supercomputing Applications
  • UChicago, University of Chicago Computation Institute
  • Southampton, University of Southampton, PASOA and Provenance projects
  • USC/ISI, University Of Southern California/Information Sciences Institute
  • And more ...
  • ...
>
>
If you have not already done so for the first challenge, in order to create a TWiki page on which you can upload your results, you each have to register with the site at the URL below. It would be helpful if you could register sooner rather than later, even if you don't create a page or edit anything, so that we know who is taking part.

Changed:
<
<
-->

Provenance Queries Matrix

>
>
http://twiki.gridprovenance.org/bin/view/TWiki/TWikiRegistration

Changed:
<
<
The ProvenanceQueriesMatrix is a summary of how queries are supported by the different systems.
>
>

Provenance Challenge Workshop

A meeting to discuss the results of the second provenance challenge will be held as a workshop at the next High Performance Distributed Computing conference on the 25th and/or 26th June.


Added:
>
>

Mailing list


Deleted:
<
<

Mailing list


A mailing list for challenge related issues has been set up. An archive is available from http://www.ipaw.info/mail/archive.php/. In order to subscribe please send

Line: 59 to 34

Once subscribed messages can be sent to provenance-challenge@ipaw.info .
Deleted:
<
<

After the First Provenance Challenge

Participants please add to both of these!

Second Provenance Challenge

  • Details on collaborations between teams to show interoperability?


 <<O>>  Difference Topic WebHome (r1.42 - 06 Oct 2006 - SimonMiles)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 65 to 65

Deleted:
<
<

Second Provenance Challenge

 <<O>>  Difference Topic WebHome (r1.41 - 06 Oct 2006 - SteveMunroe)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 65 to 65

Added:
>
>

Second Provenance Challenge

 <<O>>  Difference Topic WebHome (r1.40 - 14 Sep 2006 - SimonMiles)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 60 to 59

Once subscribed messages can be sent to provenance-challenge@ipaw.info .
Added:
>
>

After the First Provenance Challenge

Participants please add to both of these!

Second Provenance Challenge

  • Details on collaborations between teams to show interoperability?


For information, other challenges have been previously set in other areas of computer science.

 <<O>>  Difference Topic WebHome (r1.39 - 13 Sep 2006 - LucMoreau)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 22 to 22

Contributions by Participating Teams

Changed:
<
<
Each team should make a copy of the ChallengeTemplate and produce their entry below.
>
>
Each team should make a copy of the ChallengeTemplate and produce their entry on the ParticipatingTeams page.

Added:
>
>
<!--

  • REDUX, Database Research Group, MSR
  • MINDSWAP, Semantic Web Research Group, University of Maryland, College Park
  • Karma, Computer Science Department, Indiana University
Line: 44 to 45

  • And more ...
  • ...
Changed:
<
<
>
>
-->

Provenance Queries Matrix

The ProvenanceQueriesMatrix is a summary of how queries are supported by the different systems.

 <<O>>  Difference Topic WebHome (r1.38 - 12 Sep 2006 - ShawnBowers)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 34 to 34

  • ES3, University of California, Santa Barbara
  • UPenn, University of Pennsylvania, Database Group
  • RWS, UC Davis and SDSC, California
Added:
>
>
  • DAKS, Genome Center, UC Davis, California

  • PASS, Harvard
  • SDG, Pacific Northwest National Lab
  • NcsaD2k and NcsaCi, National Center for Supercomputing Applications
 <<O>>  Difference Topic WebHome (r1.37 - 12 Sep 2006 - GaurangMehta)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 39 to 39

  • NcsaD2k and NcsaCi, National Center for Supercomputing Applications
  • UChicago, University of Chicago Computation Institute
  • Southampton, University of Southampton, PASOA and Provenance projects
Added:
>
>
  • USC/ISI, University Of Southern California/Information Sciences Institute

  • And more ...
  • ...
 <<O>>  Difference Topic WebHome (r1.36 - 12 Sep 2006 - SimonMiles)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 38 to 38

  • SDG, Pacific Northwest National Lab
  • NcsaD2k and NcsaCi, National Center for Supercomputing Applications
  • UChicago, University of Chicago Computation Institute
Added:
>
>
  • Southampton, University of Southampton, PASOA and Provenance projects

  • And more ...
  • ...
 <<O>>  Difference Topic WebHome (r1.35 - 12 Sep 2006 - MichaelWilde)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 37 to 37

  • PASS, Harvard
  • SDG, Pacific Northwest National Lab
  • NcsaD2k and NcsaCi, National Center for Supercomputing Applications
Added:
>
>
  • UChicago, University of Chicago Computation Institute

  • And more ...
  • ...
 <<O>>  Difference Topic WebHome (r1.34 - 12 Sep 2006 - JoeFutrelle)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 36 to 36

  • RWS, UC Davis and SDSC, California
  • PASS, Harvard
  • SDG, Pacific Northwest National Lab
Changed:
<
<
  • And another one
>
>
  • NcsaD2k and NcsaCi, National Center for Supercomputing Applications
  • And more ...

  • ...
 <<O>>  Difference Topic WebHome (r1.33 - 11 Sep 2006 - TaraGibson)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 35 to 35

  • UPenn, University of Pennsylvania, Database Group
  • RWS, UC Davis and SDSC, California
  • PASS, Harvard
Added:
>
>
  • SDG, Pacific Northwest National Lab

  • And another one
  • ...
 <<O>>  Difference Topic WebHome (r1.32 - 08 Sep 2006 - PassProject)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 34 to 34

  • ES3, University of California, Santa Barbara
  • UPenn, University of Pennsylvania, Database Group
  • RWS, UC Davis and SDSC, California
Added:
>
>

  • And another one
  • ...
 <<O>>  Difference Topic WebHome (r1.31 - 08 Sep 2006 - LucMoreau)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance standardization at the International Provenance and Annotation Workshop (IPAW'06, www.ipaw.info), the community decided that it needs to understand the different representations used for provenance, its common aspects, and the reasons for its differences. As a result, the community agreed that a "Provenance Challenge" should be set to compare and understand existing approaches.

Deleted:
<
<
A mailing list for challenge related issues has been set up. An archive is available from http://www.ipaw.info/mail/archive.php/. In order to subscribe please send

Once subscribed messages can be sent to provenance-challenge@ipaw.info .


First Provenance Challenge

Line: 22 to 16

Provenance Challenge Workshop

Changed:
<
<
A workshop to discuss the results of the first provenance challenge will be held at the next Global Grid Forum (GGF-18), to be held in Washington, September 11 to 14, 2006. The provenance challenge workshop itself will take place on the 13th and 14th September.
>
>
A workshop to discuss the results of the first provenance challenge will be held at the next Global Grid Forum (GGF-18), to be held in Washington, September 11 to 14, 2006. The provenance challenge workshop itself will take place on the 13th and 14th September. Detailed schedule.

The workshop venue is room 147B of the conference facility (Washington Convention Center). There are details about getting to the convention center at here.

Line: 49 to 43

The ProvenanceQueriesMatrix is a summary of how queries are supported by the different systems.
Added:
>
>

Mailing list

A mailing list for challenge related issues has been set up. An archive is available from http://www.ipaw.info/mail/archive.php/. In order to subscribe please send

Once subscribed messages can be sent to provenance-challenge@ipaw.info .



For information, other challenges have been previously set in other areas of computer science.

 <<O>>  Difference Topic WebHome (r1.30 - 07 Sep 2006 - NorbertPodhorszki)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 39 to 39

  • Gridprovenance, Cardiff University
  • ES3, University of California, Santa Barbara
  • UPenn, University of Pennsylvania, Database Group
Added:
>
>
  • RWS, UC Davis and SDSC, California

  • And another one
  • ...
 <<O>>  Difference Topic WebHome (r1.29 - 07 Sep 2006 - SimonMiles)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 24 to 24

A workshop to discuss the results of the first provenance challenge will be held at the next Global Grid Forum (GGF-18), to be held in Washington, September 11 to 14, 2006. The provenance challenge workshop itself will take place on the 13th and 14th September.

Added:
>
>
The workshop venue is room 147B of the conference facility (Washington Convention Center). There are details about getting to the convention center at here.

Contributions by Participating Teams

Each team should make a copy of the ChallengeTemplate and produce their entry below.

 <<O>>  Difference Topic WebHome (r1.28 - 06 Sep 2006 - SarahCohenBoulakia)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 36 to 36

Added:
>
>
  • UPenn, University of Pennsylvania, Database Group

  • And another one
  • ...
 <<O>>  Difference Topic WebHome (r1.27 - 06 Sep 2006 - JamesFrew)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 35 to 35

Added:
>
>
  • ES3, University of California, Santa Barbara

  • And another one
  • ...
 <<O>>  Difference Topic WebHome (r1.26 - 04 Sep 2006 - VikasDeora)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 34 to 34

Changed:
<
<
  • Create here your own entry
>
>

  • And another one
  • ...
 <<O>>  Difference Topic WebHome (r1.25 - 31 Aug 2006 - EmanueleSantos)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 33 to 33

Added:
>
>

  • Create here your own entry
  • And another one
  • ...
 <<O>>  Difference Topic WebHome (r1.24 - 29 Aug 2006 - JunZhao)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 32 to 32

  • MINDSWAP, Semantic Web Research Group, University of Maryland, College Park
  • Karma, Computer Science Department, Indiana University
  • CESNET, GRID research group, CESNET z.s.p.o. Prague, Czech Republic
Added:
>
>
  • myGrid, University of Manchester

  • Create here your own entry
  • And another one
  • ...
 <<O>>  Difference Topic WebHome (r1.23 - 22 Aug 2006 - JiriSitera)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 31 to 31

  • REDUX, Database Research Group, MSR
  • MINDSWAP, Semantic Web Research Group, University of Maryland, College Park
  • Karma, Computer Science Department, Indiana University
Added:
>
>

  • Create here your own entry
  • And another one
  • ...
 <<O>>  Difference Topic WebHome (r1.22 - 21 Aug 2006 - SimonMiles)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 13 to 13

First Provenance Challenge

Changed:
<
<
The first provenance challenge commenced on 2006-June-19 and will conclude in September with a day-long workshop, where teams compare their approaches. Please see first provenance challenge for full details on the challenge.
>
>
The first provenance challenge commenced on 2006-June-19 and will conclude in September with a day-long workshop, where teams compare their approaches. Please see first provenance challenge for full details on the challenge. The deadline for uploading results is the start of the workshop, 13th September 2006.

Each participating team will upload their material on the twiki, adopting the following templage ChallengeTemplate. In order to create a TWiki page on which you can upload your results, you each have to register with the site at the URL below. It would be helpful if you could register sooner rather than later, even if you don't create a page or edit anything, so that we know who is taking part.

Line: 22 to 22

Provenance Challenge Workshop

Changed:
<
<
A workshop to discuss the results of the first provenance challenge will be held at the next Global Grid Forum (GGF-18), to be held in Washington, September 11 to 14, 2006. The exact date of the workshop within the forum will be announced shortly.
>
>
A workshop to discuss the results of the first provenance challenge will be held at the next Global Grid Forum (GGF-18), to be held in Washington, September 11 to 14, 2006. The provenance challenge workshop itself will take place on the 13th and 14th September.

Contributions by Participating Teams

 <<O>>  Difference Topic WebHome (r1.21 - 24 Jul 2006 - YogeshSimmhan)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 30 to 30

  • REDUX, Database Research Group, MSR
  • MINDSWAP, Semantic Web Research Group, University of Maryland, College Park
Added:
>
>
  • Karma, Computer Science Department, Indiana University

  • Create here your own entry
  • And another one
  • ...
 <<O>>  Difference Topic WebHome (r1.20 - 21 Jul 2006 - JenniferGolbeck)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 29 to 29

Each team should make a copy of the ChallengeTemplate and produce their entry below.

  • REDUX, Database Research Group, MSR
Added:
>
>
  • MINDSWAP, Semantic Web Research Group, University of Maryland, College Park

  • Create here your own entry
  • And another one
  • ...
 <<O>>  Difference Topic WebHome (r1.19 - 18 Jul 2006 - LucianoDigiampietri)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 28 to 28

Each team should make a copy of the ChallengeTemplate and produce their entry below.

Changed:
<
<
>
>
  • REDUX, Database Research Group, MSR

  • Create here your own entry
  • And another one
  • ...
 <<O>>  Difference Topic WebHome (r1.18 - 18 Jul 2006 - LucianoDigiampietri)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 28 to 28

Each team should make a copy of the ChallengeTemplate and produce their entry below.

Added:
>
>

  • Create here your own entry
  • And another one
  • ...
 <<O>>  Difference Topic WebHome (r1.17 - 30 Jun 2006 - SimonMiles)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

 <<O>>  Difference Topic WebHome (r1.16 - 28 Jun 2006 - SimonMiles)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 13 to 13

First Provenance Challenge

Changed:
<
<
The first provenance challenge commences 2006-Jun-15 and will conclude in September with a day-long workshop, where teams compare their approaches.
>
>
The first provenance challenge commenced on 2006-June-19 and will conclude in September with a day-long workshop, where teams compare their approaches. Please see first provenance challenge for full details on the challenge.

Each participating team will upload their material on the twiki, adopting the following templage ChallengeTemplate. In order to create a TWiki page on which you can upload your results, you each have to register with the site at the URL below. It would be helpful if you could register sooner rather than later, even if you don't create a page or edit anything, so that we know who is taking part.

http://twiki.gridprovenance.org/bin/view/TWiki/TWikiRegistration


Deleted:
<
<
Each participating team will upload their material on the twiki, adopting the following templage ChallengeTemplate.

Provenance Challenge Workshop

 <<O>>  Difference Topic WebHome (r1.15 - 27 Jun 2006 - SimonMiles)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance

Line: 17 to 17

Each participating team will upload their material on the twiki, adopting the following templage ChallengeTemplate.

Added:
>
>

Provenance Challenge Workshop

A workshop to discuss the results of the first provenance challenge will be held at the next Global Grid Forum (GGF-18), to be held in Washington, September 11 to 14, 2006. The exact date of the workshop within the forum will be announced shortly.


Contributions by Participating Teams

Each team should make a copy of the ChallengeTemplate and produce their entry below.

 <<O>>  Difference Topic WebHome (r1.14 - 13 Jun 2006 - LucMoreau)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance standardization at the International Provenance and Annotation Workshop (IPAW'06, www.ipaw.info), the community decided that it needs to understand the different representations used for provenance, its common aspects, and the reasons for its differences. As a result, the community agreed that a "Provenance Challenge" should be set to compare and understand existing approaches.

Changed:
<
<
A mailing list for challenge related issues has been set up. An archive will be made availalbe soon. In order to subscribe please send
>
>
A mailing list for challenge related issues has been set up. An archive is available from http://www.ipaw.info/mail/archive.php/. In order to subscribe please send

 <<O>>  Difference Topic WebHome (r1.13 - 07 Jun 2006 - PaulGroth)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance standardization at the International Provenance and Annotation Workshop (IPAW'06, www.ipaw.info), the community decided that it needs to understand the different representations used for provenance, its common aspects, and the reasons for its differences. As a result, the community agreed that a "Provenance Challenge" should be set to compare and understand existing approaches.

Changed:
<
<
A mailing list for challenge related issues has been set up and its archive is available from XXX. In order to subscribe ....
>
>
A mailing list for challenge related issues has been set up. An archive will be made availalbe soon. In order to subscribe please send

Once subscribed messages can be sent to provenance-challenge@ipaw.info .


First Provenance Challenge

 <<O>>  Difference Topic WebHome (r1.12 - 06 Jun 2006 - LucMoreau)

Provenance Challenge Wiki

Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance standardization at the International Provenance and Annotation Workshop (IPAW'06, www.ipaw.info), the community decided that it needs to understand the different representations used for provenance, its common aspects, and the reasons for its differences. As a result, the community agreed that a "Provenance Challenge" should be set to compare and understand existing approaches.

Added:
>
>
A mailing list for challenge related issues has been set up and its archive is available from XXX. In order to subscribe ....

First Provenance Challenge

Line: 14 to 16

Contributions by Participating Teams

Added:
>
>
Each team should make a copy of the ChallengeTemplate and produce their entry below.

Added:
>
>
  • Create here your own entry
  • And another one
  • ...

Provenance Queries Matrix

The ProvenanceQueriesMatrix is a summary of how queries are supported by the different systems.



For information, other challenges have been previously set in other areas of computer science.

Deleted:
<
<
META FILEATTACHMENT BrainAtlas?.png attr="" comment="Brain Atlas workflow" date="1147791744" path="BrainAtlas.png" size="5220" user="SimonMiles" version="1.1"
 <<O>>  Difference Topic WebHome (r1.11 - 31 May 2006 - LucMoreau)

Provenance Challenge Wiki

Changed:
<
<
Welcome to the home of TWiki.Challenge. This is a web-based collaboration area for the Provenance Challenge. This Web is public, but to edit, you need to be registered.
>
>
Provenance is a critical concept in scientific workflows, since it allows scientists to understand the origin of their results, to repeat their experiments, and to validate the processes that were used to derive data products. During a discussion on provenance standardization at the International Provenance and Annotation Workshop (IPAW'06, www.ipaw.info), the community decided that it needs to understand the different representations used for provenance, its common aspects, and the reasons for its differences. As a result, the community agreed that a "Provenance Challenge" should be set to compare and understand existing approaches.

First Provenance Challenge

Changed:
<
<
The first provenance challenge commences 2006-July-15. The full details can be found here.
>
>
The first provenance challenge commences 2006-Jun-15 and will conclude in September with a day-long workshop, where teams compare their approaches.

Each participating team will upload their material on the twiki, adopting the following templage ChallengeTemplate.

Contributions by Participating Teams


Deleted:
<
<

Other Challenges


Deleted:
<
<
http://sws-challenge.org/wiki/index.php/Main_Page

Deleted:
<
<
This is a Music-IR contest that I am involved in.

Changed:
<
<
http://www.music-ir.org/evaluation/ http://www.music-ir.org/mirexwiki/index.php/Main_Page
>
>


Changed:
<
<
http://fling-l.seas.upenn.edu/~plclub/cgi-bin/poplmark/
>
>
For information, other challenges have been previously set in other areas of computer science.

META FILEATTACHMENT BrainAtlas?.png attr="" comment="Brain Atlas workflow" date="1147791744" path="BrainAtlas.png" size="5220" user="SimonMiles" version="1.1"
 <<O>>  Difference Topic WebHome (r1.10 - 22 May 2006 - SimonMiles)
Changed:
<
<
Welcome to the home of TWiki.Challenge. This is a web-based collaboration area for the Provenance Challenge. This Web is public, but to edit, you need to be registered.

Provenance Challenge

The provenance challenge aims to establish a common understanding of the capabilities (and limitations) of available provenance-related technologies. In particular, once the systems have been compared, we aim to have an understanding of the following.

  • The representations that we each use to document details of processes that have occurred (e.g. workflows run)
  • The capabilities of each system in answering provenance-related queries
  • What each system considers to be within scope of the topic of provenance (regardless of whether the system can yet achieve all problems in that scope)

While we hope that all participating projects can contribute to the above aims, we explicitly do not assume that every system will be best evaluated by the same criteria, or have the same definition of provenance, ideal way to illustrate its potential or scope in terms of which queries are considered relevant.

To help achieve the aims, we define a single example process that forms the basis of the challenge. Here, we use the term process to denote a series of procedures being performed in a system, each taking some data as input and producing other data as output. We do not assume that these procedures must use some particular form of technology (EXE files, Web Services etc.) or that the process is explicitly defined in a workflow script, but individual projects will interpret the technologies through which the process is executed. While based on a real workflow, the procedures in the example process are expected to be implemented as "dummies", i.e. we provide the input, output and intermediate data and the projects use fake procedures that take the right input and produce the right output. If a project wishes to enact the real process, this is also possible.

In addition to this, we will define a set of core queries that all partipicipants should show how they address, so we can compare systems. The process and core queries will be decided on collectively by the challenge participants.

Each project participating in the challenge will have its own page on this TWiki where they can inform the rest of their efforts in meeting the challenge. During the provenance challenge, we expect the participants to upload the following to their page, to then allow comparison.

  • Representations of documentation of processes or provenance for the example process
  • Extensions to the example process to best illustrate the unique aspects of their system
  • Additional queries (beyond the core queries) that illustrate the scope of their system
  • Any categorisation of queries that the project considers to have practical value

In addition to the above project pages, the result of the challenge will take the form of the following tables by which direct comparison between systems can be made.

  • A matrix of queries vs systems, indicating for each that: (1) the query can be answered by the system, (2) the system cannot answer the query now but considers it relevant, (3) the query is not relevant to the project.

Example Process

>
>

Provenance Challenge Wiki


Changed:
<
<
We propose an example process for creating population-based "brain atlases" from the fMRI Data Center's archive of high resolution anatomical MR data, using a multi-stage I/O intensive pipeline that warps the anatomical features of the brain of multiple subjects into a standard space. The process is shown below.

BrainAtlas.png

It is comprised of procedures, shown as orange ovals, and data items flowing between them, shown as trapeziums. It can be seen as five stages, where each stage is depicted as a horizontal row of the same procedure in the figure. The procedures employ the AIR (automated image registration) suite to create an averaged brain from a collection of high resolution anatomical data. In addition to the data items shown in the figure, there are other inputs to procedures (constant string options): we will make these evident in the workflow in the future.

The inputs to a workflow are a set of new brain images (3.a to 5.a) and a single reference brain image (6.a). All input images are 3D scans of a brain of varying resolutions, so that different features are evident. For each image, there is the actual image (.i) and the header/metadata information for that image (.h). The stages of the workflow do as follows.

  1. For each new brain image, it is compared to the reference image to determine how the new image should be warped, i.e. the position and shape of the image adjusted, to match the reference brain. The output of each procedure in the stage is a warp parameter set defining the warp to be done.
  2. For each warp parameter set, the actual transformation is done by creating a new version of the original new brain image with the configuration defined in the warp parameter set. The output is a warped image.
  3. All the warped images are averaged into one single image.
  4. For each dimension (x, y and z), the averaged image is sliced to give a 2D atlas along a plane in that dimension, taken through the centre of the 3D image. The output is an atlas data set.
  5. For each atlas data set, it is converted into a graphical atlas image.

align in1 in2 in 3
  out1 out2
  parameters

Core Provenance Queries

An initial set of provenance-related queries is given below. These are intended only as a starting point for discussion to find the core provenance questions for the challenge.

  • Find the process that led to a graphical atlas image / everything that caused the graphical atlas image to be as it is. This should tell us the new brain images from which the averaged atlas was generated, the warping performed etc.
  • Find all calls to procedure alignwarp, and their runtimes, with argument model=rigid that ran in less than 30 minutes on non-ia64 processors. The argument is given as a string option to the alignwarp procedure.
  • Find the average runtime of all alignwarp calls with argument model=rigid that ran in less than 30 minutes.
  • Find all procedure calls within runs of the workflow named ‘/prod/2005/0305/prep’, a version of the brain atlas workflow given above, whose inputs were linearly aligned with model=affine
  • Find all output averaged images of softmean (average) procedures, where the warped images taken as input were align-warped with model=affine (i.e., “where softmean was preceded in the workflow, directly or indirectly, by an alignwarp procedure with argument model=affine”)
  • Find all outputs of softmean that were resliced with intensify=3
  • Find the outputs that were linearly aligned with model=affine and with input LFN metadata center=UChicago
  • Find all the graphical atlas sets that have metadata annotation studyModality with values speech, visual or audio, and return all the annotation tags of this set of data
  • Find all the metadata tags study on actual results of softmean that were linearly aligned with model= affine, and whose outputs have annotation state = IL
>
>
Welcome to the home of TWiki.Challenge. This is a web-based collaboration area for the Provenance Challenge. This Web is public, but to edit, you need to be registered.

Changed:
<
<

Timetable

>
>

First Provenance Challenge


Changed:
<
<
  • 2006-May-30: First full draft of challenge, asking potential participants to help finalise
    • Fix provenance challenge proposal TWiki page, with aims, objectives, workflow, sample queries, timetable
    • Upload input, output, intermediate data used in workflow
    • Send provenance challenge call to other possible participants, including:
      • Aims, objectives, workflow, sample queries, timetable
      • Request for comments on the above
      • Request to suggest what should be core queries
  • 2006-June-15: Challenge finalised, participants start!
  • 2006-September: Deadline for challenge results to be uploaded
  • 2006-September: Face-to-face meeting at which results are discussed
  • 2006-October-15: Comparisons performed, minutes of discussion, proposed next steps uploaded
>
>
The first provenance challenge commences 2006-July-15. The full details can be found here.

Other Challenges

 <<O>>  Difference Topic WebHome (r1.9 - 18 May 2006 - SimonMiles)

Welcome to the home of TWiki.Challenge. This is a web-based collaboration area for the Provenance Challenge. This Web is public, but to edit, you need to be registered.
 <<O>>  Difference Topic WebHome (r1.8 - 17 May 2006 - LucMoreau)

Welcome to the home of TWiki.Challenge. This is a web-based collaboration area for the Provenance Challenge. This Web is public, but to edit, you need to be registered.
Line: 41 to 41

  1. For each dimension (x, y and z), the averaged image is sliced to give a 2D atlas along a plane in that dimension, taken through the centre of the 3D image. The output is an atlas data set.
  2. For each atlas data set, it is converted into a graphical atlas image.
Added:
>
>

align in1 in2 in 3
  out1 out2
  parameters

Core Provenance Queries

An initial set of provenance-related queries is given below. These are intended only as a starting point for discussion to find the core provenance questions for the challenge.

 <<O>>  Difference Topic WebHome (r1.7 - 16 May 2006 - SimonMiles)
Added:
>
>
Welcome to the home of TWiki.Challenge. This is a web-based collaboration area for the Provenance Challenge. This Web is public, but to edit, you need to be registered.

Changed:
<
<
Welcome to the home of TWiki.Challenge. This is a web-based collaboration area for the Provenance Challenge. This Web is public, but to edit, you need to be registered.
>
>

Provenance Challenge


Added:
>
>
The provenance challenge aims to establish a common understanding of the capabilities (and limitations) of available provenance-related technologies. In particular, once the systems have been compared, we aim to have an understanding of the following.
  • The representations that we each use to document details of processes that have occurred (e.g. workflows run)
  • The capabilities of each system in answering provenance-related queries
  • What each system considers to be within scope of the topic of provenance (regardless of whether the system can yet achieve all problems in that scope)

While we hope that all participating projects can contribute to the above aims, we explicitly do not assume that every system will be best evaluated by the same criteria, or have the same definition of provenance, ideal way to illustrate its potential or scope in terms of which queries are considered relevant.

To help achieve the aims, we define a single example process that forms the basis of the challenge. Here, we use the term process to denote a series of procedures being performed in a system, each taking some data as input and producing other data as output. We do not assume that these procedures must use some particular form of technology (EXE files, Web Services etc.) or that the process is explicitly defined in a workflow script, but individual projects will interpret the technologies through which the process is executed. While based on a real workflow, the procedures in the example process are expected to be implemented as "dummies", i.e. we provide the input, output and intermediate data and the projects use fake procedures that take the right input and produce the right output. If a project wishes to enact the real process, this is also possible.

In addition to this, we will define a set of core queries that all partipicipants should show how they address, so we can compare systems. The process and core queries will be decided on collectively by the challenge participants.

Each project participating in the challenge will have its own page on this TWiki where they can inform the rest of their efforts in meeting the challenge. During the provenance challenge, we expect the participants to upload the following to their page, to then allow comparison.

  • Representations of documentation of processes or provenance for the example process
  • Extensions to the example process to best illustrate the unique aspects of their system
  • Additional queries (beyond the core queries) that illustrate the scope of their system
  • Any categorisation of queries that the project considers to have practical value

In addition to the above project pages, the result of the challenge will take the form of the following tables by which direct comparison between systems can be made.

  • A matrix of queries vs systems, indicating for each that: (1) the query can be answered by the system, (2) the system cannot answer the query now but considers it relevant, (3) the query is not relevant to the project.

Example Process

We propose an example process for creating population-based "brain atlases" from the fMRI Data Center's archive of high resolution anatomical MR data, using a multi-stage I/O intensive pipeline that warps the anatomical features of the brain of multiple subjects into a standard space. The process is shown below.

BrainAtlas.png

It is comprised of procedures, shown as orange ovals, and data items flowing between them, shown as trapeziums. It can be seen as five stages, where each stage is depicted as a horizontal row of the same procedure in the figure. The procedures employ the AIR (automated image registration) suite to create an averaged brain from a collection of high resolution anatomical data. In addition to the data items shown in the figure, there are other inputs to procedures (constant string options): we will make these evident in the workflow in the future.

The inputs to a workflow are a set of new brain images (3.a to 5.a) and a single reference brain image (6.a). All input images are 3D scans of a brain of varying resolutions, so that different features are evident. For each image, there is the actual image (.i) and the header/metadata information for that image (.h). The stages of the workflow do as follows.

  1. For each new brain image, it is compared to the reference image to determine how the new image should be warped, i.e. the position and shape of the image adjusted, to match the reference brain. The output of each procedure in the stage is a warp parameter set defining the warp to be done.
  2. For each warp parameter set, the actual transformation is done by creating a new version of the original new brain image with the configuration defined in the warp parameter set. The output is a warped image.
  3. All the warped images are averaged into one single image.
  4. For each dimension (x, y and z), the averaged image is sliced to give a 2D atlas along a plane in that dimension, taken through the centre of the 3D image. The output is an atlas data set.
  5. For each atlas data set, it is converted into a graphical atlas image.

Core Provenance Queries

An initial set of provenance-related queries is given below. These are intended only as a starting point for discussion to find the core provenance questions for the challenge.

  • Find the process that led to a graphical atlas image / everything that caused the graphical atlas image to be as it is. This should tell us the new brain images from which the averaged atlas was generated, the warping performed etc.
  • Find all calls to procedure alignwarp, and their runtimes, with argument model=rigid that ran in less than 30 minutes on non-ia64 processors. The argument is given as a string option to the alignwarp procedure.
  • Find the average runtime of all alignwarp calls with argument model=rigid that ran in less than 30 minutes.
  • Find all procedure calls within runs of the workflow named ‘/prod/2005/0305/prep’, a version of the brain atlas workflow given above, whose inputs were linearly aligned with model=affine
  • Find all output averaged images of softmean (average) procedures, where the warped images taken as input were align-warped with model=affine (i.e., “where softmean was preceded in the workflow, directly or indirectly, by an alignwarp procedure with argument model=affine”)
  • Find all outputs of softmean that were resliced with intensify=3
  • Find the outputs that were linearly aligned with model=affine and with input LFN metadata center=UChicago
  • Find all the graphical atlas sets that have metadata annotation studyModality with values speech, visual or audio, and return all the annotation tags of this set of data
  • Find all the metadata tags study on actual results of softmean that were linearly aligned with model= affine, and whose outputs have annotation state = IL

Timetable

  • 2006-May-30: First full draft of challenge, asking potential participants to help finalise
    • Fix provenance challenge proposal TWiki page, with aims, objectives, workflow, sample queries, timetable
    • Upload input, output, intermediate data used in workflow
    • Send provenance challenge call to other possible participants, including:
      • Aims, objectives, workflow, sample queries, timetable
      • Request for comments on the above
      • Request to suggest what should be core queries
  • 2006-June-15: Challenge finalised, participants start!
  • 2006-September: Deadline for challenge results to be uploaded
  • 2006-September: Face-to-face meeting at which results are discussed
  • 2006-October-15: Comparisons performed, minutes of discussion, proposed next steps uploaded

Other Challenges


Deleted:
<
<
Other challenges:

http://sws-challenge.org/wiki/index.php/Main_Page

This is a Music-IR contest that I am involved in.

Line: 12 to 78

http://www.music-ir.org/evaluation/ http://www.music-ir.org/mirexwiki/index.php/Main_Page
Deleted:
<
<

http://fling-l.seas.upenn.edu/~plclub/cgi-bin/poplmark/
Added:
>
>
META FILEATTACHMENT BrainAtlas?.png attr="" comment="Brain Atlas workflow" date="1147791744" path="BrainAtlas.png" size="5220" user="SimonMiles" version="1.1"
 <<O>>  Difference Topic WebHome (r1.6 - 12 May 2006 - LucMoreau)
Deleted:
<
<

Welcome to the home of TWiki.Challenge. This is a web-based collaboration area for the Provenance Challenge. This Web is public, but to edit, you need to be registered.

Added:
>
>

Other challenges: http://sws-challenge.org/wiki/index.php/Main_Page

This is a Music-IR contest that I am involved in.

http://www.music-ir.org/evaluation/ http://www.music-ir.org/mirexwiki/index.php/Main_Page

http://fling-l.seas.upenn.edu/~plclub/cgi-bin/poplmark/

 <<O>>  Difference Topic WebHome (r1.5 - 10 May 2006 - LucMoreau)
Deleted:
<
<
Welcome to the home of TWiki.Challenge. This is a web-based collaboration area for ...

Deleted:
<
<
  • MeetingMinutes?

Deleted:
<
<

Site Tools of the Challenge Web


Changed:
<
<

>
>
Welcome to the home of TWiki.Challenge. This is a web-based collaboration area for the Provenance Challenge. This Web is public, but to edit, you need to be registered.

Deleted:
<
<
Notes:

  • You are currently in the Challenge web. The color code for this web is this background, so you know where you are.
  • If you are not familiar with the TWiki collaboration platform, please visit WelcomeGuest first.

Web TWiki Site Map Use to...
Main Home of Main web Search Main web Recent changes in the Main web Get notified of changes to the Main web Welcome to TWiki... Users, Groups, Offices - tour this expandable virtual workspace. ...get a first-hand feel for TWiki possibilities.
TWiki Home of TWiki web Search TWiki web Recent changes in the TWiki web Get notified of changes to the TWiki web Welcome, Registration, and other StartingPoints; TWiki history & Wiki style; All the docs... ...discover TWiki details, and how to start your own site.
Challenge Home of Challenge web Search Challenge web Recent changes in the Challenge web Get notified of changes to the Challenge web Provenance Challenge ...collaborate on the Provenance Challeng
OPM Home of OPM web Search OPM web Recent changes in the OPM web Get notified of changes to the OPM web Open Provenance Model ...collaborate on the Open Provenance Model
Provenance Home of Provenance web Search Provenance web Recent changes in the Provenance web Get notified of changes to the Provenance web A public TWiki for the EU Provenance project ...collaborate on the EU Provenance project
RPIPC3 Home of RPIPC3 web Search RPIPC3 web Recent changes in the RPIPC3 web Get notified of changes to the RPIPC3 web RPI Team for Third Provenance Challenge describe team activities
Sandbox? Home of Sandbox web? Search Sandbox web Recent changes in the Sandbox web Get notified of changes to the Sandbox web Sandbox test area with all features enabled. ...experiment in an unrestricted hands-on web.
Soca Home of Soca web Search Soca web Recent changes in the Soca web Get notified of changes to the Soca web Sister Project: Southampton Chicago Activity ...collaborate on Sister Project: Southampton Chicago Activity
SotonProvenance Home of SotonProvenance web Search SotonProvenance web Recent changes in the SotonProvenance web Get notified of changes to the SotonProvenance web Southampton Provenance page ...collaborate on
You can use color coding by web for identification and reference. This table is updated automatically based on WebPreferences settings of the individual webs. Contact L.Moreau@ecs.soton.ac.uk if you need a separate collaboration web for your team. See also AdminTools.
Legend of icons:   Home of web = Go to the home of the web
Search web = Search the web
  Recent changes in the web = See recent changes in the web
Get notified of changes to the web = Subscribe to get notified of changes by e-mail
 <<O>>  Difference Topic WebHome (r1.4 - 14 Apr 2002 - PeterThoeny)

Welcome to the home of TWiki.Challenge. This is a web-based collaboration area for ...

  • MeetingMinutes?
Changed:
<
<

Maintenance of the Challenge web

  •    (More options in WebSearch)
  • WebChanges: Find out recent modifications to the TWiki.Challenge web.
  • WebIndex: Display all TWiki.Challenge topics in alphabetical order. See also the faster WebTopicList
  • WebNotify: Subscribe to be automatically notified when something changes in the TWiki.Challenge web.
  • WebStatistics: View access statistics of the TWiki.Challenge web.
  • WebPreferences: Preferences of the TWiki.Challenge web.
>
>

Site Tools of the Challenge Web


Notes:

Changed:
<
<
  • You are currently in the TWiki.Challenge web. The color code for this web is a (SPECIFY COLOR) background, so you know where you are.
  • If you are not familiar with the TWiki collaboration tool, please visit WelcomeGuest in the TWiki.TWiki web first.
>
>

  • You are currently in the Challenge web. The color code for this web is this background, so you know where you are.
  • If you are not familiar with the TWiki collaboration platform, please visit WelcomeGuest first.

Web TWiki Site Map Use to...
Main Home of Main web Search Main web Recent changes in the Main web Get notified of changes to the Main web Welcome to TWiki... Users, Groups, Offices - tour this expandable virtual workspace. ...get a first-hand feel for TWiki possibilities.
TWiki Home of TWiki web Search TWiki web Recent changes in the TWiki web Get notified of changes to the TWiki web Welcome, Registration, and other StartingPoints; TWiki history & Wiki style; All the docs... ...discover TWiki details, and how to start your own site.
Challenge Home of Challenge web Search Challenge web Recent changes in the Challenge web Get notified of changes to the Challenge web Provenance Challenge ...collaborate on the Provenance Challeng
OPM Home of OPM web Search OPM web Recent changes in the OPM web Get notified of changes to the OPM web Open Provenance Model ...collaborate on the Open Provenance Model
Provenance Home of Provenance web Search Provenance web Recent changes in the Provenance web Get notified of changes to the Provenance web A public TWiki for the EU Provenance project ...collaborate on the EU Provenance project
RPIPC3 Home of RPIPC3 web Search RPIPC3 web Recent changes in the RPIPC3 web Get notified of changes to the RPIPC3 web RPI Team for Third Provenance Challenge describe team activities
Sandbox? Home of Sandbox web? Search Sandbox web Recent changes in the Sandbox web Get notified of changes to the Sandbox web Sandbox test area with all features enabled. ...experiment in an unrestricted hands-on web.
Soca Home of Soca web Search Soca web Recent changes in the Soca web Get notified of changes to the Soca web Sister Project: Southampton Chicago Activity ...collaborate on Sister Project: Southampton Chicago Activity
SotonProvenance Home of SotonProvenance web Search SotonProvenance web Recent changes in the SotonProvenance web Get notified of changes to the SotonProvenance web Southampton Provenance page ...collaborate on
You can use color coding by web for identification and reference. This table is updated automatically based on WebPreferences settings of the individual webs. Contact L.Moreau@ecs.soton.ac.uk if you need a separate collaboration web for your team. See also AdminTools.
Legend of icons:   Home of web = Go to the home of the web
Search web = Search the web
  Recent changes in the web = See recent changes in the web
Get notified of changes to the web = Subscribe to get notified of changes by e-mail
 <<O>>  Difference Topic WebHome (r1.3 - 07 Apr 2002 - PeterThoeny)

Welcome to the home of TWiki.Challenge. This is a web-based collaboration area for ...

  • MeetingMinutes?
Line: 18 to 18

  • You are currently in the TWiki.Challenge web. The color code for this web is a (SPECIFY COLOR) background, so you know where you are.
  • If you are not familiar with the TWiki collaboration tool, please visit WelcomeGuest in the TWiki.TWiki web first.
Changed:
<
<

Web TWiki Site Map Use to...
Main Home of Main web Search Main web Recent changes in the Main web Get notified of changes to the Main web Welcome to TWiki... Users, Groups, Offices - tour this expandable virtual workspace. ...get a first-hand feel for TWiki possibilities.
TWiki Home of TWiki web Search TWiki web Recent changes in the TWiki web Get notified of changes to the TWiki web Welcome, Registration, and other StartingPoints; TWiki history & Wiki style; All the docs... ...discover TWiki details, and how to start your own site.
Challenge Home of Challenge web Search Challenge web Recent changes in the Challenge web Get notified of changes to the Challenge web Provenance Challenge ...collaborate on the Provenance Challeng
OPM Home of OPM web Search OPM web Recent changes in the OPM web Get notified of changes to the OPM web Open Provenance Model ...collaborate on the Open Provenance Model
Provenance Home of Provenance web Search Provenance web Recent changes in the Provenance web Get notified of changes to the Provenance web A public TWiki for the EU Provenance project ...collaborate on the EU Provenance project
RPIPC3 Home of RPIPC3 web Search RPIPC3 web Recent changes in the RPIPC3 web Get notified of changes to the RPIPC3 web RPI Team for Third Provenance Challenge describe team activities
Sandbox? Home of Sandbox web? Search Sandbox web Recent changes in the Sandbox web Get notified of changes to the Sandbox web Sandbox test area with all features enabled. ...experiment in an unrestricted hands-on web.
Soca Home of Soca web Search Soca web Recent changes in the Soca web Get notified of changes to the Soca web Sister Project: Southampton Chicago Activity ...collaborate on Sister Project: Southampton Chicago Activity
SotonProvenance Home of SotonProvenance web Search SotonProvenance web Recent changes in the SotonProvenance web Get notified of changes to the SotonProvenance web Southampton Provenance page ...collaborate on
You can use color coding by web for identification and reference. This table is updated automatically based on WebPreferences settings of the individual webs. Contact L.Moreau@ecs.soton.ac.uk if you need a separate collaboration web for your team. See also AdminTools.
Legend of icons:   Home of web = Go to the home of the web
Search web = Search the web
  Recent changes in the web = See recent changes in the web
Get notified of changes to the web = Subscribe to get notified of changes by e-mail
(included from SiteMap; change WebHome to include from SiteMap instead of TWikiWebsTable)
>
>

Web TWiki Site Map Use to...
Main Home of Main web Search Main web Recent changes in the Main web Get notified of changes to the Main web Welcome to TWiki... Users, Groups, Offices - tour this expandable virtual workspace. ...get a first-hand feel for TWiki possibilities.
TWiki Home of TWiki web Search TWiki web Recent changes in the TWiki web Get notified of changes to the TWiki web Welcome, Registration, and other StartingPoints; TWiki history & Wiki style; All the docs... ...discover TWiki details, and how to start your own site.
Challenge Home of Challenge web Search Challenge web Recent changes in the Challenge web Get notified of changes to the Challenge web Provenance Challenge ...collaborate on the Provenance Challeng
OPM Home of OPM web Search OPM web Recent changes in the OPM web Get notified of changes to the OPM web Open Provenance Model ...collaborate on the Open Provenance Model
Provenance Home of Provenance web Search Provenance web Recent changes in the Provenance web Get notified of changes to the Provenance web A public TWiki for the EU Provenance project ...collaborate on the EU Provenance project
RPIPC3 Home of RPIPC3 web Search RPIPC3 web Recent changes in the RPIPC3 web Get notified of changes to the RPIPC3 web RPI Team for Third Provenance Challenge describe team activities
Sandbox? Home of Sandbox web? Search Sandbox web Recent changes in the Sandbox web Get notified of changes to the Sandbox web Sandbox test area with all features enabled. ...experiment in an unrestricted hands-on web.
Soca Home of Soca web Search Soca web Recent changes in the Soca web Get notified of changes to the Soca web Sister Project: Southampton Chicago Activity ...collaborate on Sister Project: Southampton Chicago Activity
SotonProvenance Home of SotonProvenance web Search SotonProvenance web Recent changes in the SotonProvenance web Get notified of changes to the SotonProvenance web Southampton Provenance page ...collaborate on
You can use color coding by web for identification and reference. This table is updated automatically based on WebPreferences settings of the individual webs. Contact L.Moreau@ecs.soton.ac.uk if you need a separate collaboration web for your team. See also AdminTools.
Legend of icons:   Home of web = Go to the home of the web
Search web = Search the web
  Recent changes in the web = See recent changes in the web
Get notified of changes to the web = Subscribe to get notified of changes by e-mail
 <<O>>  Difference Topic WebHome (r1.2 - 24 Nov 2001 - PeterThoeny)

Welcome to the home of TWiki.Challenge. This is a web-based collaboration area for ...

  • MeetingMinutes?
Line: 8 to 8

  •    (More options in WebSearch)
  • WebChanges: Find out recent modifications to the TWiki.Challenge web.
Changed:
<
<
  • WebIndex: Display all TWiki.Challenge topics in alphabetical order.
>
>
  • WebIndex: Display all TWiki.Challenge topics in alphabetical order. See also the faster WebTopicList

  • WebNotify: Subscribe to be automatically notified when something changes in the TWiki.Challenge web.
  • WebStatistics: View access statistics of the TWiki.Challenge web.
  • WebPreferences: Preferences of the TWiki.Challenge web.
 <<O>>  Difference Topic WebHome (r1.1 - 08 Aug 2001 - PeterThoeny)
Line: 1 to 1
Added:
>
>
Welcome to the home of TWiki.Challenge. This is a web-based collaboration area for ...

  • MeetingMinutes?

Maintenance of the Challenge web

  •    (More options in WebSearch)
  • WebChanges: Find out recent modifications to the TWiki.Challenge web.
  • WebIndex: Display all TWiki.Challenge topics in alphabetical order.
  • WebNotify: Subscribe to be automatically notified when something changes in the TWiki.Challenge web.
  • WebStatistics: View access statistics of the TWiki.Challenge web.
  • WebPreferences: Preferences of the TWiki.Challenge web.

Notes:

  • You are currently in the TWiki.Challenge web. The color code for this web is a (SPECIFY COLOR) background, so you know where you are.
  • If you are not familiar with the TWiki collaboration tool, please visit WelcomeGuest in the TWiki.TWiki web first.

Web TWiki Site Map Use to...
Main Home of Main web Search Main web Recent changes in the Main web Get notified of changes to the Main web Welcome to TWiki... Users, Groups, Offices - tour this expandable virtual workspace. ...get a first-hand feel for TWiki possibilities.
TWiki Home of TWiki web Search TWiki web Recent changes in the TWiki web Get notified of changes to the TWiki web Welcome, Registration, and other StartingPoints; TWiki history & Wiki style; All the docs... ...discover TWiki details, and how to start your own site.
Challenge Home of Challenge web Search Challenge web Recent changes in the Challenge web Get notified of changes to the Challenge web Provenance Challenge ...collaborate on the Provenance Challeng
OPM Home of OPM web Search OPM web Recent changes in the OPM web Get notified of changes to the OPM web Open Provenance Model ...collaborate on the Open Provenance Model
Provenance Home of Provenance web Search Provenance web Recent changes in the Provenance web Get notified of changes to the Provenance web A public TWiki for the EU Provenance project ...collaborate on the EU Provenance project
RPIPC3 Home of RPIPC3 web Search RPIPC3 web Recent changes in the RPIPC3 web Get notified of changes to the RPIPC3 web RPI Team for Third Provenance Challenge describe team activities
Sandbox? Home of Sandbox web? Search Sandbox web Recent changes in the Sandbox web Get notified of changes to the Sandbox web Sandbox test area with all features enabled. ...experiment in an unrestricted hands-on web.
Soca Home of Soca web Search Soca web Recent changes in the Soca web Get notified of changes to the Soca web Sister Project: Southampton Chicago Activity ...collaborate on Sister Project: Southampton Chicago Activity
SotonProvenance Home of SotonProvenance web Search SotonProvenance web Recent changes in the SotonProvenance web Get notified of changes to the SotonProvenance web Southampton Provenance page ...collaborate on
You can use color coding by web for identification and reference. This table is updated automatically based on WebPreferences settings of the individual webs. Contact L.Moreau@ecs.soton.ac.uk if you need a separate collaboration web for your team. See also AdminTools.
Legend of icons:   Home of web = Go to the home of the web
Search web = Search the web
  Recent changes in the web = See recent changes in the web
Get notified of changes to the web = Subscribe to get notified of changes by e-mail
(included from SiteMap; change WebHome to include from SiteMap instead of TWikiWebsTable)
Revision r1.1 - 08 Aug 2001 - 05:20 - PeterThoeny
Revision r1.52 - 01 Dec 2009 - 16:09 - LucMoreau