You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: CHANGELOG
+51-1Lines changed: 51 additions & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -1,6 +1,56 @@
1
-
GOBBLIN 0.6.1
1
+
GOBBLIN 0.6.2
2
2
=============
3
3
4
+
## NEW FEATURES
5
+
* [Admin Dashboard] Added a web based GUI for exploring running and finished jobs in a running Gobblin daemon (thanks Eric Ogren).
6
+
* [Admin Dashboard] Added a CLI for finding jobs in the job history store and seeing their run details (thanks Eric Ogren).
7
+
* [Configuration Management] WIP: Configuration management library. Will enable Gobblin to be dataset aware, ie. to dynamically load and apply different configurations to each dataset in a single Gobblin job.
8
+
** APIs: APIs for configuration stores and configuration client.
9
+
** Configuration Library: loads low level configurations from a configuration store, resolves configuration dependencies / imports, and performs value interpolation.
10
+
* [Distcp] Allow using *.ready files as markers for files that should be copied, and deletion of *.ready files once the file has been copied.
11
+
* [Distcp] Added file filters to recursive copyable dataset for distcp. Allows to only copy files satisfying a filter under a base directory.
12
+
* [Distcp] Copied files that fail to be published are persisted for future runs. Future runs can recover the already copied file instead of re-doing the byte transfer.
13
+
* [JDBC] Can use password encryption for JDBC sources.
14
+
* [YARN] Added email notifications on YARN application shutdown.
15
+
* [YARN] Added event notifications on YARN container status changes.
16
+
* [Metrics] Added metric filters based on name and type of the metrics.
17
+
* [Dataset Management] POC embedded sql for config-driven retention management.
18
+
* [Exactly Once] POC for Gobblin managed exactly once semantics on publisher.
19
+
20
+
## BUG FIXES
21
+
* **Core** File based source includes previously failed WorkUnits event if there are no new files in the source (thanks Joel Baranick).
22
+
* **Core** Ensure that output file list does not contain duplicates due to task retries (thanks Joel Baranick).
23
+
* **Core** Fix NPE in CliOptions.
24
+
* **Core/YARN** Limit Props -> Typesafe Config conversion to a few keys to prevent overwriting of certain properties.
25
+
* **Utility** Fixed writer mkdirs for S3.
26
+
* **Metrics** Made Scheduled Reporter threads into daemon threads to prevent hanging application.
27
+
* **Metrics** Fixed enqueuing of events on event reporters that was causing job failure if event frequency was too high.
28
+
* **Build** Fix POM dependencies on gobblin-rest-api.
29
+
* **Build** Added conjars and cloudera repository to all projects (fixes builds for certain users).
30
+
* **Build** Fix the distribution tarball creation (thanks Joel Baranick).
31
+
* **Build** Added option to exclude Hadoop and Hive jars from distribution tarball.
32
+
* **Build** Removed log4j.properties from runtime resources.
33
+
* **Compaction** Fixed main class in compaction manifest file (thanks Lorand Bendig).
34
+
* **JDBC** Correctly close JDBC connections.
35
+
36
+
## IMPROVEMENTS
37
+
* [Build] Add support for publishing libraries to maven local (thanks Joel Baranick).
38
+
* [Build] In preparation to Gradle 2 migration, added ext. prefix to custom gradle properties.
39
+
* [Build] Can generate project dependencies graph in dot format.
40
+
* [Metrics] Migrated Kafka reporter and Output stream reporter to Root Metrics Reporter managed reporting.
41
+
* [Metrics] The last metric emission in the application has a "final" tag for easier Hive identification.
42
+
* [Metrics] Metrics for Gobblin on YARN include cluster tags.
43
+
* [Hive] Upgraded Hive to version 1.0.1.
44
+
* [Distcp] Add file size to distcp success notifications.
45
+
* [Distcp] Each work unit in distcp contains exactly one Copyable File.
46
+
* [Distcp] Copy source can set upstream timestamps for SLA events emitted on publish time.
47
+
* [Scheduling] Added Gobblin Oozie config files.
48
+
* [Documentation] Improved javadocs.
49
+
50
+
51
+
GOBBLIN 0.6.1
52
+
-------------
53
+
4
54
## BUG FIXES
5
55
6
56
- **Build/release** Adding build instrumentation for generation of rest-api-* artifacts
0 commit comments