SFTP does not delete processed files #2

slifty · 2019-07-05T17:02:05Z

Right now if someone uploads to SFTP then we do not actually delete or move it after processing. This means that over time the SFTP buckets will just get bigger and bigger for no reason.

There is some value in this, since the user can see that the file has been uploaded.

Lets find a way to improve this process so we don't have files duplicated for long periods of time after proper processing.

reefdog · 2019-07-05T17:05:22Z

Out of curiosity, what would happen if a user changed the name of a file that was already on the server (and had already otherwise been processed)? Would that trigger a new import, or would a de-duper notice it was the same already-processed file and ignore it?

isTravis · 2019-07-05T17:37:57Z

It seems that changing the metadata, key, etc of an s3 file creates a new copy of that file. We use the ObjectCreated event and this stackoverflow answer seems to suggest that ObjectCreated will be re-triggered. So, it seems that a rename would re-trigger processing.

That said, I believe the de-duper would catch it (the hash is generated on the file binary itself, not the s3 key name of the file).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SFTP does not delete processed files #2

SFTP does not delete processed files #2

slifty commented Jul 5, 2019

reefdog commented Jul 5, 2019

isTravis commented Jul 5, 2019

SFTP does not delete processed files #2

SFTP does not delete processed files #2

Comments

slifty commented Jul 5, 2019

reefdog commented Jul 5, 2019

isTravis commented Jul 5, 2019