Add support for ETL scenarios: dump a full ez installation, transform it, reimport it somewhere else #57

gggeek · 2016-08-23T12:06:16Z

In short: we could do a full ezp db dump, doing a breadth-first descent and making sure that user accounts are dumped 1st and contents 2nd.

What would be missing is the handling of object relations: since in ezp relations can be circular, the import should allow creating content even with broken object-relations, and do multiple passes to fix those after all contents have been created.

Note: this most likely depends on issues #34 and #46.

gggeek · 2016-10-22T14:49:31Z

Prerequisite: #55

gggeek · 2016-10-22T14:49:44Z

Prerequisite: #56

gggeek · 2016-10-22T14:49:58Z

Prerequisite: #54

gggeek · 2016-10-22T14:50:53Z

Prerequisite: #34

gggeek · 2016-10-22T14:56:12Z

More prerequisites:

a migration loader that scans directories recursively, as 1M files can not be in a single directory
a 'migrate' command that uses parallel processing to import contents for speed
a command that drops the existing migration table, to ease tests / multiple executions (or allows removing migrations based on regexp matching on name or path)
a command that drops all contents except the top-level folders and admin+anon users, their 3 sections, anon and admin roles, for cleanup of target db (this could be probably achieved with existing migration steps, but for speed it is probably better to use custom SQL queries...)
a 'generate' migration which actually saves files to disk, to be used by the high-level 'export' command
a way to add settings to tune the way migration definitions are created for content export
an 'export' command that splits work in parallel threads
an 'upsert' migration for the cases where the target installation already has contents (Support upsert migrations #245)
a flexible way of matching contents between source and export databases by id/remote-id mappers

gggeek · 2017-03-14T10:27:22Z

Prerequisite: #102

gggeek · 2017-11-12T23:09:27Z

Steps forward in release 4.4

gggeek · 2018-11-25T10:31:10Z

Steps forward in release 5.4.1 and 5.5

gggeek · 2018-11-30T13:25:52Z

Steps forward in release 5.6

gggeek · 2018-12-15T14:39:19Z

Bugfixes in 5.7.3

gggeek · 2020-11-04T22:17:33Z

Step forward in release 5.13: Upserts are now possible, albeit in an impractical way:
create a migration with 3 steps:

load target item by identifier, with allow_null_results and set reference to count
create item with if condition: reference equals 0
update item with if condition: reference equals 0

@blankse I know this is way late, but it probably is a stepping stone for simplified upsert steps

gggeek added the enhancement label Aug 23, 2016

gggeek added this to the 4.0 milestone Aug 23, 2016

gggeek mentioned this issue Mar 14, 2017

Add support for content staging scenarios #110

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for ETL scenarios: dump a full ez installation, transform it, reimport it somewhere else #57

Add support for ETL scenarios: dump a full ez installation, transform it, reimport it somewhere else #57

gggeek commented Aug 23, 2016

gggeek commented Oct 22, 2016

gggeek commented Oct 22, 2016

gggeek commented Oct 22, 2016

gggeek commented Oct 22, 2016

gggeek commented Oct 22, 2016 •

edited

Loading

gggeek commented Mar 14, 2017

gggeek commented Nov 12, 2017

gggeek commented Nov 25, 2018

gggeek commented Nov 30, 2018

gggeek commented Dec 15, 2018

gggeek commented Nov 4, 2020

Add support for ETL scenarios: dump a full ez installation, transform it, reimport it somewhere else #57

Add support for ETL scenarios: dump a full ez installation, transform it, reimport it somewhere else #57

Comments

gggeek commented Aug 23, 2016

gggeek commented Oct 22, 2016

gggeek commented Oct 22, 2016

gggeek commented Oct 22, 2016

gggeek commented Oct 22, 2016

gggeek commented Oct 22, 2016 • edited Loading

gggeek commented Mar 14, 2017

gggeek commented Nov 12, 2017

gggeek commented Nov 25, 2018

gggeek commented Nov 30, 2018

gggeek commented Dec 15, 2018

gggeek commented Nov 4, 2020

gggeek commented Oct 22, 2016 •

edited

Loading