|
1 | | -# Understanding and Running the DSC Workflow |
| 1 | +# Understanding and Running DSC |
2 | 2 |
|
3 | | -This documentation describes the DSC workflow and how to run the application. |
| 3 | +This documentation describes the DSC application and how to run it. |
4 | 4 |
|
5 | 5 | **DISCLAIMER**: While the CLI application is runnable on its own, the DSO Step Function offers a simplified user interface for running the full ETL pipeline. For more details on the DSO Step Function and how to use it, see https://mitlibraries.atlassian.net/wiki/spaces/IN/pages/4690542593/DSpace+Submission+Orchestrator+DSO. |
6 | 6 |
|
7 | | -## The DSC Workflow |
8 | | - |
9 | | -The DSC workflow consists of the following key steps: |
| 7 | +Running DSC consists of the following key steps: |
10 | 8 |
|
11 | 9 | 1. Create a batch |
12 | 10 | 2. Queue a batch for ingest |
13 | | -3. Items are ingested into DSpace by DSS* |
| 11 | +3. _Items are ingested into DSpace by DSS*_ |
14 | 12 | 4. Analyze ingest results |
15 | 13 |
|
16 | 14 | ***Important:** DSC is not responsible for ingesting items into DSpace nor does it execute this process. This task is handled by [DSS](https://github.com/MITLibraries/dspace-submission-service), which is invoked via the [DSO Step Function](https://github.com/MITLibraries/dspace-submission-service). |
17 | 15 |
|
18 | | -The DSC CLI provides commands for all other steps in the DSC workflow. |
19 | | - |
20 | 16 | ### Create a batch |
21 | | -DSC processes deposits in "batches", a collection of item submissions grouped by a unique identifier. DSC requires that the item submission assets (metadata and bitstream files) are uploaded to a "folder" in S3, named after the batch ID. While some requestors may upload the submission assets to S3 themselves, in other cases, these files need to be retrieved (via API requests) and uploaded during the batch creation step. |
| 17 | +DSC processes deposits in "batches", a collection of item submissions grouped by a unique identifier. Generally, assets for batches are provided in one of two ways: |
| 18 | + |
| 19 | +1. Requestors upload raw metadata and bitstreams (see [How To: Batch deposits with DSO for Content Owners](https://mitlibraries.atlassian.net/wiki/spaces/INF/pages/4411326470/How-To+Batch+deposits+with+DSpace+Submission+Orchestrator+DSO+for+Content+Owners)) |
| 20 | +2. DSC workflows retrieve raw metadata and/or bitstreams programmatically (via API requests) |
| 21 | + |
| 22 | +Once all assets are stored in a "folder" in S3, DSC verifies that metadata and bitstreams have been provided for each item submission. It is only after _all_ item submissions have been verified that DSC will establish the batch by recording each item in DynamoDB. |
22 | 23 |
|
23 | 24 | At the end of this step: |
24 | 25 | * If all item submission assets are complete: |
|
0 commit comments