Third Provenance Challenge | ||||||||
| Line: 7 to 7 | ||||||||
|---|---|---|---|---|---|---|---|---|
Current Status | ||||||||
| Changed: | ||||||||
| < < |
| |||||||
| > > |
||||||||
| Changed: | ||||||||
| < < |
| |||||||
| > > |
Workshop Details | |||||||
| Added: | ||||||||
| > > |
| |||||||
Participating Teams | ||||||||
Third Provenance Challenge | ||||||||
| Line: 11 to 11 | ||||||||
|---|---|---|---|---|---|---|---|---|
| ||||||||
| Changed: | ||||||||
| < < |
| |||||||
| > > |
| |||||||
Participating Teams | ||||||||
Third Provenance Challenge | ||||||||
| Line: 7 to 7 | ||||||||
|---|---|---|---|---|---|---|---|---|
Current Status | ||||||||
| Changed: | ||||||||
| < < |
||||||||
| > > |
| |||||||
| ||||||||
| Changed: | ||||||||
| < < |
| |||||||
| > > |
| |||||||
Participating Teams | ||||||||
Third Provenance Challenge | ||||||||
| Line: 7 to 7 | ||||||||
|---|---|---|---|---|---|---|---|---|
Current Status | ||||||||
| Changed: | ||||||||
| < < |
Teams should be implementing the workflow below, running the core queries and starting to export OPM formatted data. | |||||||
| > > |
||||||||
| Changed: | ||||||||
| < < |
Workshop details can be found at LocalDetailsPC3. | |||||||
| > > |
| |||||||
Participating Teams | ||||||||
Third Provenance Challenge | ||||||||
| Line: 7 to 7 | ||||||||
|---|---|---|---|---|---|---|---|---|
Current Status | ||||||||
| Changed: | ||||||||
| < < |
Teams should be implementing the workflow below, running the core queries and starting to export OPM formatted data. Workshop details can be found at LocalDetailsPC3. | |||||||
| > > |
Teams should be implementing the workflow below, running the core queries and starting to export OPM formatted data. Workshop details can be found at LocalDetailsPC3. | |||||||
Participating Teams | ||||||||
Third Provenance Challenge | ||||||||
| Line: 38 to 38 | ||||||||
|---|---|---|---|---|---|---|---|---|
| 4. Export OPM Graphs and import from others [Apr 13 - May 4] | ||||||||
| Changed: | ||||||||
| < < |
5. Run queries on imported OPM graph [Apr 27 - Jun 1] | |||||||
| > > |
5. Run queries on imported OPM graph [May 4 - Jun 1] | |||||||
| 6. Prepare slides for challenge [Jun 1 - Jun 8] | ||||||||
Third Provenance Challenge | ||||||||
| Line: 7 to 7 | ||||||||
|---|---|---|---|---|---|---|---|---|
Current Status | ||||||||
| Changed: | ||||||||
| < < |
Teams should be implementing the workflow below, running the core queries and starting to export OPM formatted data. | |||||||
| > > |
Teams should be implementing the workflow below, running the core queries and starting to export OPM formatted data. Workshop details can be found at LocalDetailsPC3. | |||||||
Participating Teams | ||||||||
Third Provenance Challenge | ||||||||
| Line: 5 to 5 | ||||||||
|---|---|---|---|---|---|---|---|---|
| The toplevel page for the third provenance challenge. | ||||||||
| Deleted: | ||||||||
| < < |
||||||||
Current Status | ||||||||
| Changed: | ||||||||
| < < |
The Challenge has started. Teams should be implementing the workflow below. We are still looking for suggested queries. Participating teams should set up a page to document their results as outlined in the next section. | |||||||
| > > |
Teams should be implementing the workflow below, running the core queries and starting to export OPM formatted data. | |||||||
Participating TeamsPages for each participating team can be found at the ParticipatingTeams3 page. If you are participating, please create a link to your teams page there. You can use the Test Team page as a template for what should be included in a team page.Sponsors | ||||||||
| Changed: | ||||||||
| < < |
Thanks to our sponsor, the Virtual Laboratory for e-Science. | |||||||
| > > |
Thanks to our sponsors, the Virtual Laboratory for e-Science and Microsoft | |||||||
| Changed: | ||||||||
| < < |
| |||||||
| > > |
| |||||||
Schedule | ||||||||
| Line: 75 to 81 | ||||||||
| ||||||||
| Added: | ||||||||
| > > |
| |||||||
Third Provenance Challenge | ||||||||
| Line: 13 to 13 | ||||||||
|---|---|---|---|---|---|---|---|---|
Participating Teams | ||||||||
| Changed: | ||||||||
| < < |
Pages for each participating team can be found at the ParticipatingTeams3 page. If you are participating, please create a link to your teams page there. You can use the USC/ISI page as a template for what should be included in a team page. | |||||||
| > > |
Pages for each participating team can be found at the ParticipatingTeams3 page. If you are participating, please create a link to your teams page there. You can use the Test Team page as a template for what should be included in a team page. | |||||||
SponsorsThanks to our sponsor, the Virtual Laboratory for e-Science. | ||||||||
Third Provenance Challenge | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Line: 56 to 56 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Provenance Challenge Workflow | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Changed: | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| < < |
![]() [PNG] | [PDF] | [PPTX] | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| > > |
The PC3 workflow and its software implementation in .Net, Java, and shell scripts can be found at the ThirdPCWorkflow page. Below is the background of the workflow. | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
BackgroundThe Pan-STARRS project is building and operating the next generation sky survey with the ability to continuously scan the visible sky once a week and build a time series of data. This helps detect moving objects that may potentially impact with earth besides building a massive catalog of the solar system and 99% of visible stars in the northern hemisphere. The collaboration is lead by the University of Hawai'i that operates the telescope and image pipeline while Johns Hopkins University is building the object data management (ODM) framework that is exposed to astronomers. The load workflow used in PC3 appears at the handoff between the image pipeline and the ODM, and uses the Trident workbench to ingest incoming CSV files into SQL Server databases. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Line: 68 to 69 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Alex Szalay (Johns Hopkins University) | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Deleted: | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| < < |
Workflow | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Changed: | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| < < |
Activities
Provenance Challenge Software SetupOverviewFlavorsC# .NET + SQL Server Flavor (Windows Platform)Pre-requisites
Folder OrganizationSetup/Compile/Build/RunBatch Scripting/Executable VersionJava + Derby Flavor (All Platforms)Pre-requisites
Folder OrganizationSetup/Compile/Build/RunShell Scripting/Executable VersionFrequently Asked QuestionsGeneralC# .NETSQL ServerJavaDerbyDownloads
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| > > |
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Third Provenance Challenge | |||||||||||||||||||||||
| Line: 12 to 15 | |||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Pages for each participating team can be found at the ParticipatingTeams3 page. If you are participating, please create a link to your teams page there. You can use the USC/ISI page as a template for what should be included in a team page. | |||||||||||||||||||||||
| Added: | |||||||||||||||||||||||
| > > |
SponsorsThanks to our sponsor, the Virtual Laboratory for e-Science. | ||||||||||||||||||||||
Schedule1. Review of code and provenance query proposals (to Feb 27) | |||||||||||||||||||||||
| Line: 207 to 215 | |||||||||||||||||||||||
| -- YogeshSimmhan - 04 Feb 2009 | |||||||||||||||||||||||
| Added: | |||||||||||||||||||||||
| > > |
|||||||||||||||||||||||
| |||||||||||||||||||||||
| Added: | |||||||||||||||||||||||
| > > |
| ||||||||||||||||||||||
Third Provenance Challenge | ||||||||
| Line: 52 to 52 | ||||||||
|---|---|---|---|---|---|---|---|---|
[PNG] | [PDF] | [PPTX]
Background | ||||||||
| Added: | ||||||||
| > > |
The Pan-STARRS project is building and operating the next generation sky survey with the ability to continuously scan the visible sky once a week and build a time series of data. This helps detect moving objects that may potentially impact with earth besides building a massive catalog of the solar system and 99% of visible stars in the northern hemisphere. The collaboration is lead by the University of Hawai'i that operates the telescope and image pipeline while Johns Hopkins University is building the object data management (ODM) framework that is exposed to astronomers. The load workflow used in PC3 appears at the handoff between the image pipeline and the ODM, and uses the Trident workbench to ingest incoming CSV files into SQL Server databases. Acknowledgement Jim Heasley (University of Hawai'i) Alex Szalay (Johns Hopkins University) | |||||||
Workflow | ||||||||
Third Provenance Challenge | ||||||||
| Line: 44 to 44 | ||||||||
|---|---|---|---|---|---|---|---|---|
Provenance Questions | ||||||||
| Changed: | ||||||||
| < < |
Please list possible provenance queries for the Challenge here. If the query requires any additions to the workflow please detail them as well.
Suggested Query 1For a given detection, which CSV files contributed to it? Basic sample answer: The CSV file containing the Detection table. Advanced sample answer: The CSV file containing the Detection table, CSV file containing the Image table (as the image is an attribute of the detection), and CSV file containing the FrameMetadata table (as the frame metadata is an attribute of the image).Suggested Query 2A CSV or header file is deleted during the workflow's execution. How much time expired between a successful IsMatchCSVFileTables test (when the file existed) and an unsuccessful IsExistsCSVFile? test (when the file had been deleted)? Sample answer: 3ms For testing the above query, we it may be simplest to edit the workflow to include deletion of the CSV file as a step.Suggested Query 3The user considers a table to contain values they do not expect. Was the range check (IsMatchTableColumnRanges) performed for this table? Sample answer: YesSuggested Query 4The workflow halts due to failing an IsMatchTableColumnRanges check. How many tables successfully loaded before the workflow halted due to a failed check? Sample answer: 2Suggested Query 5Which operation executions were strictly necessary for the Image table to contain a particular (non-computed) value? Sample answer: call of ReadCSVReadyFile, call of CreateEmptyLoadDB, 2nd call of ReadCSVFileColumnNames, 2nd call of LoadCSVFileIntoTable (2nd calls because Image is loaded in the 2nd iteration of the for loop, excluded checks because they do not change anything, excluded UpdatedComputedColumns because it is non-computed, excluded CompactDatabase because it does not affect the value).Suggested Query 6Which pairs of procedures in the workflow could be swapped and the same result still be obtained (given the particular data input)? Sample answer: (I won't enumerate them all, but I think some can be swapped as the checks in particular are not causally dependent, but we cannot swap those inside the loop with those outside). | |||||||
| > > |
Please list possible provenance queries for the Challenge here. If the query requires any additions to the workflow please detail them as well. | |||||||
Provenance Challenge Workflow | ||||||||
Third Provenance Challenge | ||||||||
| Line: 6 to 6 | ||||||||
|---|---|---|---|---|---|---|---|---|
Current Status | ||||||||
| Changed: | ||||||||
| < < |
As a result of a meeting at e-Science 2008. We have selected the Pan-Starrs workflow from the ThirdProvenanceChallengeWorkflowProposals. | |||||||
| > > |
The Challenge has started. Teams should be implementing the workflow below. We are still looking for suggested queries. Participating teams should set up a page to document their results as outlined in the next section. | |||||||
| Changed: | ||||||||
| < < |
We are currently finalizing the PanStarrs? workflow code and documentation for the challenge. You can find the code and documentation below. | |||||||
| > > |
Participating Teams | |||||||
| Changed: | ||||||||
| < < |
We are also putting together provenance queries to be used during the challenge. Please put your suggestions below | |||||||
| > > |
Pages for each participating team can be found at the ParticipatingTeams3 page. If you are participating, please create a link to your teams page there. You can use the USC/ISI page as a template for what should be included in a team page. | |||||||
Schedule | ||||||||
| Line: 30 to 30 | ||||||||
| PC3 Workshop June 10 - 11 held in Amsterdam | ||||||||
| Added: | ||||||||
| > > |
Challenge Goals1. identify weaknesses and strengths of the the OPM specification 2. encourage the development of concrete bindings for OPM in a variety of languages 3. determine how well OPM can represent provenance for a variety of technologies (scientific workflow, databases, etc.) 4. demonstrate that a complex data products provenance can be constructed from provenance documentation produced by multiple combinations of heterogenous applications 5. bring together the community to further discuss the interoperability of provenance systems. | |||||||
Provenance QuestionsPlease list possible provenance queries for the Challenge here. If the query requires any additions to the workflow please detail them as well. | ||||||||
Third Provenance Challenge | ||||||||
| Line: 34 to 34 | ||||||||
|---|---|---|---|---|---|---|---|---|
| Please list possible provenance queries for the Challenge here. If the query requires any additions to the workflow please detail them as well. | ||||||||
| Added: | ||||||||
| > > |
Suggested Query 1For a given detection, which CSV files contributed to it? Basic sample answer: The CSV file containing the Detection table. Advanced sample answer: The CSV file containing the Detection table, CSV file containing the Image table (as the image is an attribute of the detection), and CSV file containing the FrameMetadata table (as the frame metadata is an attribute of the image).Suggested Query 2A CSV or header file is deleted during the workflow's execution. How much time expired between a successful IsMatchCSVFileTables test (when the file existed) and an unsuccessful IsExistsCSVFile? test (when the file had been deleted)? Sample answer: 3ms For testing the above query, we it may be simplest to edit the workflow to include deletion of the CSV file as a step.Suggested Query 3The user considers a table to contain values they do not expect. Was the range check (IsMatchTableColumnRanges) performed for this table? Sample answer: YesSuggested Query 4The workflow halts due to failing an IsMatchTableColumnRanges check. How many tables successfully loaded before the workflow halted due to a failed check? Sample answer: 2Suggested Query 5Which operation executions were strictly necessary for the Image table to contain a particular (non-computed) value? Sample answer: call of ReadCSVReadyFile, call of CreateEmptyLoadDB, 2nd call of ReadCSVFileColumnNames, 2nd call of LoadCSVFileIntoTable (2nd calls because Image is loaded in the 2nd iteration of the for loop, excluded checks because they do not change anything, excluded UpdatedComputedColumns because it is non-computed, excluded CompactDatabase because it does not affect the value).Suggested Query 6Which pairs of procedures in the workflow could be swapped and the same result still be obtained (given the particular data input)? Sample answer: (I won't enumerate them all, but I think some can be swapped as the checks in particular are not causally dependent, but we cannot swap those inside the loop with those outside). | |||||||
Provenance Challenge Workflow![]() | ||||||||
Third Provenance Challenge | |||||||||||||
| Line: 36 to 36 | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Provenance Challenge Workflow | |||||||||||||
| Added: | |||||||||||||
| > > |
![]() [PNG] | [PDF] | [PPTX] | ||||||||||||
Background | |||||||||||||
| Line: 176 to 180 | |||||||||||||
| |||||||||||||
| Added: | |||||||||||||
| > > |
| ||||||||||||
-- YogeshSimmhan - 04 Feb 2009
| |||||||||||||
| Added: | |||||||||||||
| > > |
| ||||||||||||
Third Provenance Challenge | |||||||||||||||||||||||||||||||||||||||||||||||||||||
| Line: 42 to 42 | |||||||||||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
WorkflowActivities | |||||||||||||||||||||||||||||||||||||||||||||||||||||
| Added: | |||||||||||||||||||||||||||||||||||||||||||||||||||||
| > > |
| ||||||||||||||||||||||||||||||||||||||||||||||||||||
| Added: | |||||||||||||||||||||||||||||||||||||||||||||||||||||
| > > |
-- YogeshSimmhan - 17 Feb 2009 | ||||||||||||||||||||||||||||||||||||||||||||||||||||
Provenance Challenge Software SetupOverview | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Third Provenance Challenge | ||||||||
| Line: 8 to 8 | ||||||||
|---|---|---|---|---|---|---|---|---|
| As a result of a meeting at e-Science 2008. We have selected the Pan-Starrs workflow from the ThirdProvenanceChallengeWorkflowProposals. | ||||||||
| Changed: | ||||||||
| < < |
Current Tasks (End of January 2009): | |||||||
| > > |
We are currently finalizing the PanStarrs? workflow code and documentation for the challenge. You can find the code and documentation below. | |||||||
| Changed: | ||||||||
| < < |
| |||||||
| > > |
We are also putting together provenance queries to be used during the challenge. Please put your suggestions below
Schedule1. Review of code and provenance query proposals (to Feb 27) March 2 - PC3 Starts 2. Make the workflow work with individual team's systems [Mar 2 - Mar 30] 3. Generate provenance for the challenge workflow & run queries on it [Mar 30 - Apr 13] 4. Export OPM Graphs and import from others [Apr 13 - May 4] 5. Run queries on imported OPM graph [Apr 27 - Jun 1] 6. Prepare slides for challenge [Jun 1 - Jun 8] PC3 Workshop June 10 - 11 held in Amsterdam | |||||||
Provenance Questions | ||||||||
Third Provenance Challenge | |||||||||||
| Line: 19 to 19 | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Please list possible provenance queries for the Challenge here. If the query requires any additions to the workflow please detail them as well. | |||||||||||
| Deleted: | |||||||||||
| < < |
-- PaulGroth - 15 Dec 2008 | ||||||||||
| Added: | |||||||||||
| > > |
Provenance Challenge WorkflowBackgroundWorkflowActivitiesProvenance Challenge Software SetupOverviewFlavorsC# .NET + SQL Server Flavor (Windows Platform)Pre-requisites
Folder OrganizationSetup/Compile/Build/RunBatch Scripting/Executable VersionJava + Derby Flavor (All Platforms)Pre-requisites
Folder OrganizationSetup/Compile/Build/RunShell Scripting/Executable VersionFrequently Asked QuestionsGeneralC# .NETSQL ServerJavaDerbyDownloads
| ||||||||||
Third Provenance ChallengeThe toplevel page for the third provenance challenge. | ||||||||
| Changed: | ||||||||
| < < |
We are currently in the process of identifying workflows: see ThirdProvenanceChallengeWorkflowProposals. | |||||||
| > > |
Current Status | |||||||
| Added: | ||||||||
| > > |
As a result of a meeting at e-Science 2008. We have selected the Pan-Starrs workflow from the ThirdProvenanceChallengeWorkflowProposals. | |||||||
| Changed: | ||||||||
| < < |
-- LucMoreau - 25 Nov 2008 | |||||||
| > > |
Current Tasks (End of January 2009):
Provenance QuestionsPlease list possible provenance queries for the Challenge here. If the query requires any additions to the workflow please detail them as well. -- PaulGroth - 15 Dec 2008 | |||||||
| Line: 1 to 1 | ||||||||
|---|---|---|---|---|---|---|---|---|
| Added: | ||||||||
| > > |
Third Provenance ChallengeThe toplevel page for the third provenance challenge. We are currently in the process of identifying workflows: see ThirdProvenanceChallengeWorkflowProposals. -- LucMoreau - 25 Nov 2008 | |||||||