Transactional writer support #6

anicolaspp · 2019-03-18T19:48:08Z

No description provided.

anicolaspp · 2019-03-18T20:26:35Z

@iulianov check all this when you get some free time.

- no synchronization was actually happening. We need a common context for sync to happen.

anicolaspp · 2019-03-18T21:47:36Z

@iulianov we don't have to worry any longer about doc to row and row to doc conversions. It is also done for us.

iulianov · 2019-03-21T01:11:05Z

src/main/scala/com/github/anicolaspp/spark/MapRDB.scala


      if (withTransaction) {
        df.write
          .format("com.github.anicolaspp.spark.sql.writing.Writer")
          .save(path)

      } else {
+        MapRSpark.save(df, path, "_id", false, false)


If we want the connector to replace the official connector at some point then the createTable and bulkInsert options should be exposed to the caller. Either through default parameters or through the .option method. Similar to how the current connector allows df.write.option("Operation","Insert").saveToMapRDB("path")

That is fine. I will create an issue to track that.
GH-9

iulianov · 2019-03-21T01:35:04Z

Overall the logic looks good but there are still edge cases that need to be thought about and either addressed through code or documented. You already have one case in the comments but there are others.
One I can think of is if there needs to be a rollback but an error occurs, on one of the executors that is has not commited yet, while running the MapRDBCleaner code. Depending on how the spark job was started it might not return an error to the user but it also will not have removed all the added records.

iulianov · 2019-03-21T01:41:12Z

src/main/scala/com/github/anicolaspp/spark/sql/writing/MapRDBCleaner.scala

+
+    val store: DocumentStore = connection.getStore(table)
+
+    ids.foreach(store.delete)


In a normal rollback this should be fine since the only the confirmed written ids are in the ids value. But if one of the records to be deleted has been removed by an external application then the store.delete should throw an exception. It might need to be caught and ignored here.
One way to prevent a try catch here is to use the checkAndDelete method although that might actually create more overhead.

anicolaspp and others added 7 commits March 9, 2019 13:27

First Attempt to bring Transactions To MapRDB with Spark

8395e56

Adding transaction support

1cb610f

no needed

66e88cc

refactorings

59f0d48

Add files via upload

f95476a

Update Adding Transaction Support to MapR-DB.md

6ab1838

Merge branch 'master' into transactional-writer-support

b567132

anicolaspp added 3 commits March 18, 2019 13:39

sychronization context was wrong

deb95ad

- no synchronization was actually happening. We need a common context for sync to happen.

using mapr internal transformations

2fd0b86

Update MapRDBDataWriterFactory.scala

902ba07

some cleaning

9807c59

iulianov reviewed Mar 21, 2019

View reviewed changes

anicolaspp closed this Mar 21, 2019

anicolaspp reopened this Mar 21, 2019

iulianov reviewed Mar 21, 2019

View reviewed changes

anicolaspp merged commit 46aa470 into master Mar 21, 2019

anicolaspp deleted the transactional-writer-support branch March 21, 2019 03:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Transactional writer support #6

Transactional writer support #6

anicolaspp commented Mar 18, 2019

anicolaspp commented Mar 18, 2019

anicolaspp commented Mar 18, 2019

iulianov Mar 21, 2019

anicolaspp Mar 21, 2019

iulianov commented Mar 21, 2019

iulianov Mar 21, 2019


		val store: DocumentStore = connection.getStore(table)

		ids.foreach(store.delete)

Transactional writer support #6

Transactional writer support #6

Conversation

anicolaspp commented Mar 18, 2019

anicolaspp commented Mar 18, 2019

anicolaspp commented Mar 18, 2019

iulianov Mar 21, 2019

Choose a reason for hiding this comment

anicolaspp Mar 21, 2019

Choose a reason for hiding this comment

iulianov commented Mar 21, 2019

iulianov Mar 21, 2019

Choose a reason for hiding this comment