Skip to content

Latest commit

 

History

History
192 lines (155 loc) · 10.1 KB

CONTRIBUTORS.md

File metadata and controls

192 lines (155 loc) · 10.1 KB

Author

The LinkedIn Gradle Plugin for Apache Hadoop was created and designed by Alex Bain. The design of the LinkedIn Gradle DSL for Apache Hadoop was influenced by azkaban-rb from Matthew Hayes and with input from Will Vaughan.

Contributors

The following were contributed by Jin Sha. Thanks, Jin!

  • Added support for model auto-publish for KabootarJob

The following were contributed by Arpan Agrawal. Thanks, Arpan!

  • Added AutoTunePigLiJob job type support

The following were contributed by Jun "Tony" Zhou. Thanks, Tony!

  • Added Python Spark support for runSparkJob task.
  • Added support for submitting python Spark applications with Spark job type. Corresponsing tests are added.
  • Added application running mode check in Spark plugin. Spark job now run on YARN cluster unless explicitly specified otherwise.

The following were contributed by Jin Hyuk Chang. Thanks, Jin Hyuk!

  • DSS-5053: Added Hive ORC table as a source in hdfsToTeradata job type.
  • DSS-4743: Hadoop DSL for SQL job type. Adding encrypted password support on Teradata job type in Hadoop DSL.
  • DSS-3870: Hadoop DSL for new Azkaban job type. Teradata <-> HDFS, HDFS -> Espresso, and Gobblin

The following were contributed by Anthony Hsu. Thanks, Anthony!

  • LIHADOOP-16591: ligradle azkabanUpload fails with "ClassNotFoundException: org.apache.http.client.methods.HttpPost"

The following were contributed by Jonathan Hung. Thanks, Jonathan!

  • LIHADOOP-37013. Tensorflow execution type multiplexing between TF on Spark and TF on YARN
  • Remove taskCommand in tensorflow dsl job
  • LIHADOOP-35318 Implement TensorFlow job in hadoop dsl
  • TOOLS-67569 Block Hadoop Client Teams from checking in Hadoop Jars in their MP

The following were contributed by Anant Nag. Thanks, Anant!

  • Minor fix to the test plugin
  • LIHADOOP-28747 Hadoop plugin's test plugin should have functionality to add assertions on tests.
  • LIHADOOP-25519 ligradle tasks to test the flow on Azkaban
  • LIHADOOP-25518 Prototype deploy tests for the workflows
  • LIHADOOP-23255: Add azkaban jvm properties to the jobConf for bangbang
  • LIHADOOP-18923: Integrate bangbang with li-hadoop-plugin
  • LIHADOOP-18499 Add extra typed methods for Hadoop DSL SparkJob properties
  • LIHADOOP-6730: Add clusterProvided configuration for dependency management
  • LIHADOOP-17034 Create HadoopShell job type in Hadoop DSL
  • LIHADOOP-16016 Minor bug fix for li-hadoop-plugin
  • LIHADOOP-15728 Update dependency pattern blacklist rules for li-hadoop-plugin
  • LIHADOOP-15501 checkDependencies task fails if any of groupID, name or version is null
  • LIHADOOP-15077 Hadoop plugin should have a feature to control and monitor dependencies specified by the users
  • Fixups on HadoopJavaJob and HiveJob
  • LIHADOOP-14506 Hadoop plugin should directly submit jobs to oozie
  • LIHADOOP-13682 Oozie Hadoop DSL Compiler support property files
  • LIHADOOP-13681 Oozie Hadoop DSL Compiler support all job types
  • Added xsd schemas for different job types
  • LIHADOOP-14566 SparkPlugin does not read application parameters
  • LIHADOOP-12557 Hadoop Plugin should support launching Spark scripts from a gateway machine
  • LIHADOOP-14099 Create spark job type in hadoop dsl
  • LIHADOOP-13660 Hadoop Plugin oozieUpload task should upload the contents of the hadoopZip task
  • Minor fixes on OozieUploadTask and LiKerberosUtil
  • LIHADOOP-13353 Hadoop Plugin oozieUpload task should upload to versioned directory.
  • LIHADOOP-13457 Fix Stackoverflow exception thrown by OozieUploadTask.
  • LIHADOOP-12771 Prototype Hadoop Plugin Upload Task for Oozie
  • HADOOP-12726 Style cleanups for Hadoop zip tests
  • HADOOP-12243 Fix Hadoop Plugin PCL Breakage
  • HADOOP-10148 Support exclusion list for Hadoop Plugin sources Zip

The following were contributed by Akshay Rai. Thanks, Akshay!

  • LIHADOOP-12945 Hadoop Plugin azkabanUpload enhancement to print project URL
  • HADOOP-12243 Fix Hadoop Plugin PCL Breakage
  • HADOOP-10773 Rewrite li-azkaban2 zip upload tasks for the Hadoop Plugin
  • HADOOP-8251 Hadoop DSL doesn't use -p flag for mkdir

The following were contributed by Keith Dsouza. Thanks, Keith!

  • Added support to the KafkaPushJob to accept the disable.auditing property

The following were contributed by Ragesh Rajagopalan. Thanks, Ragesh!

  • LIHADOOP-20985 azkabanUpload throwing 500 error
  • DI-584 Login prompt missing when uploading to Azkaban

The following were contributed by Pranay Hasan Yerra. Thanks, Pranay!

  • LIHADOOP-32983 Azkaban CLI Tasks should prompt for "password+VIP" internal to Linkedin
  • Addressed a bug in ready status. Cleaned up the codenarc errors in azkaban-client
  • LIHADOOP-33260 Bump to Gradle 4.1
  • LIHADOOP-26107 Parameters to override the setting in .azkabanPlugin.json
  • LIHADOOP-16252 Prevent core-site.xml from polluting IDE's classpath
  • LIHADOOP-25566 Update Dr.Elephant URL link in azkabanFlowStatus task
  • LIHADOOP-25637 Check for trailing slash in AzkabanUrl when running azkabanUpload task
  • LIHADOOP-24274 Checks for empty fields in .azkabanPlugin.json
  • LIHADOOP-21680 Azkaban CLI Features Create Project, Flow Status, Execute Flow and Cancel Flow
  • LIHADOOP-23788 Display custom message after azkabanUpload task
  • LIHADOOP-22850 Handle null console and minor changes in upload bar's display
  • LIHADOOP-22822 The azkabanUpload task should automatically go into edit mode if there are missing required fields
  • LIHADOOP-22809 Display configured Hadoop zips during azkabanUpload task
  • LIHADOOP-22655 Automate creation of Azkaban Project during azkabanUpload task. Also avoids uploading twice on invalid session
  • LIHADOOP-22222 Display Upload Status of Zip while running azkabanUpload task
  • LIHADOOP-21797 Automate the creation of the .azkabanPlugin.json file
  • LIHADOOP-21658 Refactor AzkabanUploadTask in Hadoop Plugin by creating AzkabanHelper class

The following were contributed by Pratik Satish. Thanks, Pratik!

  • LIHADOOP-21429 Add HadoopValidatorPlugin with PigValidatorPlugin in its stack

The following were contributed by Ivan Heda. Thanks, Ivan!

  • Added possibility to have Azkaban password in configuration JSON.

The following were contributed by Alexander Ivaniuk. Thanks, Alexander!

  • Add possibility to group jobs into subflows in Azkaban UI

The following were contributed by Jennifer Dai. Thanks, Jennifer!

  • Add ability for Hadoop DSL to understand the PinotBuildAndPushJob job type

The following were contributed by Jianling Zhong. Thanks, Jianling!

  • Add supports for a "required parameters" check

The following were contributed by Charlie Summers. Thanks, Charlie!

  • DSL Trigger Implementation
  • Upgrade li-hadoop-plugin for Flow 2.0
  • Refactor YamlCompiler to fit closer with AzkabanDslCompiler
  • Allow configurable Yaml creation for Flow 2.0
  • Introduce YamlCompiler, YamlWorkflow, YamlJob, and YamlProject for Flow 2.0
  • Add ability for Hadoop DSL to understand the TableauJob job type
  • Adding additional condition for TableauJob creation
  • Allow generateYamlOutput to be defined multiple times
  • Remove maxWaitMins requirement if no dependencies defined in Trigger
  • Handling emergent flows for Flow 2.0

The following were contributed by Nicholas Cowan. Thanks, Nick!

  • Add ability for Hadoop DSL to understand the HdfsWait job type
  • Add subdirectory hadoop-jobs
  • Adding timezone parameter and ability for directoryPath to build paths based on date for hdfsWaitJobs

The following were contributed by Charles Gao. Thanks, Charles!

  • Add ability for Hadoop DSL to understand the VenicePushJob job type

The following were contributed by Michal Trna. Thanks, Michal!

  • Session IDs are stored separately for each Azkaban server

The following were contributed by Tzu-Han Jan. Thanks, TJ!

  • Updated Hadoop DSL for Kafka-Push-Job to understand multiTopic configurations

The following were contributed by Chris Li.

  • Remove jdbcDriverClass requirement from SQL job type

The following were contributed by Jamie Sun. Thanks, Jamie!

  • Add conditional workflow feature
  • Fix base properties scope for subflows when generating YAML files
  • Fix variable substitution issue when generating YAML files
  • Fix variable substitution issue for dependency list
  • Fix cloneWorkflow and cloneJob issue when generating conditional workflows
  • Fix variable substitution issue for flow dependency list

The following were contributed by Rakesh Malladi. Thanks, Rakesh!

  • Add ability for Hadoop DSL to understand the WormholePushJob job type
  • Deprecate dataset.group property from the WormholePushJob

The following were contributed by Yeni Bermudez. Thanks, Yeni!

  • Add a workaround to handle visiting files multiple times when merging tempprops into .flow files.

The following were contributed by Abhishek Nimesh. Thanks, Abhishek!

  • Add ability for Hadoop DSL to understand the KabootarJob job type

The following were contributed by Zhixiong Chen.

  • Sort config

The following were contributed by Sayali Kadam.

  • Accept property 'ai.project.group' for KabootarJob

The following were contributed by Arjun Singh Bora.

  • Add ability for Hadoop DSL to understand the WormholePushJob2 job type

The following were contributed by Sayali Kadam.

  • Fixing property aiProjectGroup while cloning Kabootar Job

Kyle Moore (DPUkyle) contributed the following:

  • Refactored Hadoop DSL "tests" into proper integration tests using Gradle TestKit
  • Refactored li-hadoop-plugin "tests" into proper integration tests using Gradle TestKit

The following were contributed by Frank Gu.

  • Add ability for Hadoop DSL to understand the KubernetesJob job type