azkaban / azkaban-plugins Goto Github PK

Plugins for Azkaban.

License: Apache License 2.0

Java 99.81% Shell 0.19%

azkaban-plugins's Introduction

Azkaban Plugins

Because this Plugin repo is difficult to maintain, AZ team is actively moving plugin code to the main azkaban repo. You might want to check out Azkaban Github if you miss finding some code.

For all Azkaban Plugins documentation, please go to Azkaban Project Site

azkaban-plugins's People

Stargazers

Watchers

Forkers

zwqjsj0404 gigfork flipkart jerryofouc davidzchen erwa dwelte bradruderman lxiong npbool cjyu maczpc ahoyleo lishaoy altiscale rbpark xiangfu0 jayon antondam hluu wagnermarkd wyukawa taobaoguest yc2011ky fysoft2006 markgrover seanbollin navis logiclord pwangjing zhuangxr bradish1211 nxcjh codeboyyong sunzhaonan saiganji jz3707 timvdl apurbad bertpassek pratyay marcelopaesrech banyue pingjay johnyu0520 surajnayak evlstyle larissa1994 flowbehappy vivekkothari arturobayo jinhyukchang zf1315 sunmeng007 btallman vikramsp vikramkone oceanlight lilonghua1987 radimk lanyizheng batistutaluke sushil-k-s georgezhlw happyray kunkun-tang haroldl sherwood1 mixergit nntnag17 janpychou victsm socube benpaodelang solidm dvenkateshappa fangshil suvodeep-pyne convexquad kylefung rajagopr pranayhasan baifei7699 nagarjunak758-zz shulyli jamiesjc zouzou6321 hongjun07 riseofapes halley-comet zjilvufe cquptethan wary kardoo ericsimonzhu billthebest wangqiaoshi jakhani jayixl 06094051

azkaban-plugins's Issues

Running Reportal jobs should include a username and exec id

The idea behind this change is to make it easier to search for those long running reportal job from command line.

This fix will add two JVM args and they look something like this:

-Dreportal.user.name=hreportal -Dreportal.execid=72

Job Summary tweaks

Add 20px margin to bottommost box
25% width for Job Type key cell
Display placeholder when no stats are available

HDFS viewer shows local file system

The HDFS tab shows the local file system, not the Hadoop File System. Where would I change "file:///" to "hdfs://" ?

Insert provided URLS into Hadoop Configurations

Similiar to azkaban core issue #224, the included links should be passed on downstream. This only affects Hadoop specific jobtypes

Reportal output and mail temp directory retention times should be configurable

Internal JIRA: HADOOP-4646

can not clone in windows env

when i clone the repos, it always throw

 fatal: cannot create directory at 'plugins/jobtype/jobtypes/hive-0.8.1/hive-0.8.
1/aux': Invalid argument

it seems aux is not a valid name in windows environment?

thx

Java job type needs to set job id

Other job types already do this.

Reportal should let user kill his or her running report

Sometimes after firing off a run, I realize there is a bug in my code. I would like to kill the running report to avoid wasting resources. Currently, there is no way for me to kill the report from the Reportal UI. We should add this functionality.

This is especially critical now that Reportal only allows you to have one RUNNING execution of each report at a time. Sometimes jobs have OOM errors and hang, causing the flow to remain in the RUNNING state forever. In such a scenario, I will never be able to run my report again, since I have no way of killing the currently RUNNING execution.

Alternatively, we should roll back this commit, or make a new commit so that concurrent execution can be enabled/disabled by a property in the Azkaban conf file.

Tracked by internal JIRA HADOOP-6976.

Can't create aux directory

Today while checking out this project through subversion I got a strange error.
svn: E204899: Unable to make directories
This is because of below directory.
azkaban-plugins-2.1/plugins/jobtype/jobtypes/hive-0.8.1/hive-0.8.1/aux
After some research on google I found that on Windows platform a directory/file can't be created with name "aux".
Reference - http://en.wikipedia.org/wiki/Filename#Reserved_characters_and_words.

Jars not being copied to directory for new Hive jobtype

The jobtype and hadoopsecuritymanager jars are not being copied into the directory for the new Hive job type, which results in a ClassNotFound exception when the executor server starts.

This is the root cause for azkaban/azkaban#173

Azkaban jar files that were compiled with Java 1.6

Previous versions were compiled with 1.7

ParquetFileViewer only works on world-readable files

As mentioned in #114, the ParquetFileViewer tries to view every file as azkaban, meaning it can only view files owned by azkaban, with the group set to azkaban, or world-readable. This means users will not be able to view their Parquet files through the HDFSViewer unless their Parquet files are world-readable.

Reportal mail temp directory is not being cleaned periodically

The code that did this got deleted in this commit - 906b938 - but it's actually necessary, so we should add it back.

Visualizer tweaks

Display placeholder if Pig Visualizer has nothing to visualize.
Fix Auto Pan Zoom and Reset Pan Zoom buttons

Reportal should not display 1969 when a report is in the PREPARING state

When a report (flow) is in the PREPARING state, the startTime is -1, and the Reportal UI renders this as "1969-12-31 15:59:59", which confuses and alarms users. Instead, we should leave the start time blank or display a "Not Started" message.

Tracked by internal JIRA HADOOP-6960.

-Dreportal.user.name should be set to ${reportal.execution.user}

Here - https://github.com/azkaban/azkaban-plugins/blob/master/plugins/reportal/src/azkaban/viewer/reportal/ReportalTypeManager.java#L58-59 - -Dreportal.user.name should be set to ${reportal.execution.user}, not userName. We want the property to show who executed the report, not who last edited the report.

Update HadoopSecurityManager_H_2_0 to work Hadoop 2.3.x

Update HadoopSecurityManager_H_2_0 security manager to work with Hadoop 2.3.x.

Job Summary plugin job_id parsing does not work for Hadoop 2 job ids and URLs

In log-data.js, the job_id regexes expect job_<12 digits>_<4+ digits>. However, on Hadoop 2, instead of <12 digits> of the form YYYYMMDDHHMM, the job_id appears appears to contain the milliseconds since epoch, which is variable in length and is currently 13 digits.

Also, in Hadoop 2, the job URL printed out in the logs does not contain job_ (which the current url regex is looking for) but instead looks something like http://<host>:<port>/proxy/application_<milliseconds_since_epoch>_<counter>.

We should fix the regexes so that the job summary plugin will find the job ids and URLs.

Tracked by internal JIRA HADOOP-6977.

Data in Report History is displayed incorrectly

We display one cell per row instead of one row per row.
Internal JIRA: HADOOP-4463

ReportalHiveRunner does not work with Hive 0.13

The problem is that ReportalHiveRunner uses an interface method that has been removed:
https://github.com/azkaban/azkaban-plugins/blob/master/plugins/reportal/src/azkaban/jobtype/ReportalHiveRunner.java#L105

Pig 0.12 job type

May be useful to create a Pig 0.12.0 job type since Pig 0.12 is now out.

Tracked by internal JIRA ticket: HADOOP-4414

Hive plugin missing Antlr jar

hive-0.8.1/aux/lib should have the antlr runtime. I used version 3.0.1 and that worked. I don't know what is latest.
antlr-runtime-3.0.1.jar

HDFS Viewer should display more lines of the file

We reduced the limit for the lines of file displayed from 1000 to 100. The limit should be increased or made adjustable.

Tracked by internal JIRA: HADOOP-4479

Hive seems to ignore user property

The Hive plugin works with no user.to.proxy or proxy.user set anywhere. It probably defaults to the user that runs the azkaban executor daemon.

If Pig/hadoopJava did this, Azkaban security would be very easy for small shops (like us) who do not use Hadoop security features.

Reportal does not save project permissions to database, only caches them in memory

Tracked by internal JIRA HADOOP-4744.

Reduce UI whitespace

Companion to azkaban/azkaban#152

Pig Visualizer graph page no longer displays correctly

Due to the CSS change for azkaban/azkaban#157, the sidebar and graph views for the Pig Visualizer graph no longer displays correctly.

HDFS viewer should allow user to copy and paste path

Since we added spaces to the HDFS viewer breadcrumb, it is no longer possible to just copy and paste the path.

Either:

Remove the spaces
Create a textbox when clicking on the header or a button

email link is broken

This issue derives from azkaban/azkaban#247 and azkaban/azkaban#248

I am changing the MailCreator interface API, so I update ReportalMailCreator (https://github.com/azkaban/azkaban-plugins/blob/master/plugins/reportal/src/azkaban/viewer/reportal/ReportalMailCreator.java) to conform to the new interface API.

Reportal: You should be able to restrict email addresses to certain domains.

Use Hadoop Coding Conventions

Counterpart to azkaban/azkaban#249

HDFS Viewer hangs when opening a 0-byte file

The blue loading bar remains forever.

Reportal: Users should be able to specify whether they want their results rendered as HTML or not

BinaryJSON HDFS file viewer issues

The BinaryJSON file viewer seems to be unstable. At times, it fails to display a file at all and at other times, it dumps binary junk.

Tracked by internal JIRA: HADOOP-4478

Display schema in HDFS browser

Display contents of file and schema in separate tabs.

Tracked by internal JIRA ticket: HADOOP-4413

Hive queries that triggers a MapredLocalTask fail

When a MapredLocalTask executes, it launches "hadoop jar ...", but the jar it specifies is sometimes the wrong jar. It sometimes erroneously uses hive-common instead of hive-exec.

The problem is with https://github.com/azkaban/azkaban-plugins/blob/master/plugins/jobtype/src/azkaban/jobtype/hiveutils/HiveQueryExecutorModule.java#L36. We should use new HiveConf(SessionState.class) instead.

Tracked by internal JIRA HADOOP-5465.

HadoopSecurityManager_H_2_0 - support retrieving additional HDFS tokens from other namnodes for ETL use cases

For ETL distcp use cases which need to copy files from multiple clusters, the Azkaban security manager needs to retrieve the HDFS tokens from other cluster before the file copying process can start.

Permission denied when viewing files in user directory

When running Azkaban on a grid with Hadoop security enabled, viewing a file in one's user directory results in a permission denied error.

I have root-caused this to the fact that the Parquet file viewer does not use the FileSystem object passed in from the HdfsBrowserServlet, which is properly set up to doAs the current user logged rather than the azkaban user. As a result, the Parquet file viewer ends up trying to view the file as azkaban, throwing the AccessControlException. Currently, AvroParquetReader does not have an API that lets one pass in a FileSystem object. The fix for now is to remove the catch AccessControlException block from the Parquet file viewer.

Tracked by internal ticket: HADOOP-5350

Cannot find right place for user.to.proxy (also, confusion with proxy.user)

The HadoopSecurityManager_H_1_0 class expects to find a property 'user.to.proxy'. I have placed this in every configuration file and .job file, and nothing has worked.

Which file should this property be in?

Here is the full log section for this attempt at running the java-wc job. HadoopSecurityManager_H_1_0 is clearly trying to pass in a 'user.to.proxy' property which is not there.

2013/08/02 03:08:05.255 +0000 INFO [pig-upload] [Azkaban] Need to proxy. Getting tokens.
2013/08/02 03:08:05.255 +0000 INFO [pig-upload] [Azkaban] Getting hadoop tokens for apxqueue
2013/08/02 03:08:05.255 +0000 INFO [HadoopSecurityManager] [Azkaban] proxy user apxqueue not exist. Creating new proxy user
2013/08/02 03:08:05.258 +0000 INFO [pig-upload] [Azkaban] Getting DFS token from 10.176.235.204:8020hdfs://ip-10-176-235-204.us-west-1.compute.internal:8020
java.lang.NullPointerException
at org.apache.hadoop.security.SecurityUtil.setTokenService(SecurityUtil.java:246)
at org.apache.hadoop.hdfs.DFSClient.getDelegationToken(DFSClient.java:408)
at org.apache.hadoop.hdfs.DistributedFileSystem.getDelegationToken(DistributedFileSystem.java:571)
at azkaban.security.HadoopSecurityManager_H_1_0$2.getToken(HadoopSecurityManager_H_1_0.java:268)
at azkaban.security.HadoopSecurityManager_H_1_0$2.run(HadoopSecurityManager_H_1_0.java:258)
at azkaban.security.HadoopSecurityManager_H_1_0$2.run(HadoopSecurityManager_H_1_0.java:255)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:396)
at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1122)
at azkaban.security.HadoopSecurityManager_H_1_0.prefetchToken(HadoopSecurityManager_H_1_0.java:253)
at azkaban.jobtype.HadoopPigJob.getHadoopTokens(HadoopPigJob.java:166)
at azkaban.jobtype.HadoopPigJob.run(HadoopPigJob.java:102)
at azkaban.execapp.JobRunner.runJob(JobRunner.java:379)
at azkaban.execapp.JobRunner.run(JobRunner.java:280)
at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:441)
at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
at java.util.concurrent.FutureTask.run(FutureTask.java:138)
at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
at java.lang.Thread.run(Thread.java:662)
azkaban.security.commons.HadoopSecurityManagerException: Failed to get hadoop tokens! nullnull
at azkaban.security.HadoopSecurityManager_H_1_0.prefetchToken(HadoopSecurityManager_H_1_0.java:318)
at azkaban.jobtype.HadoopPigJob.getHadoopTokens(HadoopPigJob.java:166)
at azkaban.jobtype.HadoopPigJob.run(HadoopPigJob.java:102)
at azkaban.execapp.JobRunner.runJob(JobRunner.java:379)
at azkaban.execapp.JobRunner.run(JobRunner.java:280)
at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:441)
at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
at java.util.concurrent.FutureTask.run(FutureTask.java:138)
at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
at java.lang.Thread.run(Thread.java:662)
2013/08/02 03:08:05.261 +0000 ERROR [pig-upload] [Azkaban] Job run failed!
2013/08/02 03:08:05.261 +0000 ERROR [pig-upload] [Azkaban] Failed to get hadoop tokens! nullnullnull
2013/08/02 03:08:05.261 +0000 INFO [pig-upload] [Azkaban] Finishing job pig-upload at 1375412885261
2013/08/02 03:08:05.267 +0000 INFO [wordcount-java] [Azkaban] Job Finished pig-upload with status FAILED
2013/08/02 03:08:05.278 +0000 INFO [wordcount-java] [Azkaban] Killing wordcount-java due to prior errors.
2013/08/02 03:08:05.289 +0000 INFO [wordcount-java] [Azkaban] Finishing up flow. Awaiting Termination
2013/08/02 03:08:05.289 +0000 INFO [wordcount-java] [Azkaban] Setting flow status to Failed.
2013/08/02 03:08:05.289 +0000 INFO [wordcount-java] [Azkaban] Flow is set to FAILED
2013/08/02 03:08:05.289 +0000 INFO [wordcount-java] [Azkaban] Setting end time for flow 8 to 1375412885289
2013/08/02 03:08:05.305 +0000 INFO [FlowRunnerManager] [Azkaban] Flow 8 is finished. Adding it to recently finished flows list.
2013/08/02 03:10:01.532 +0000 INFO [FlowRunnerManager] [Azkaban] Cleaning recently finished
2013/08/02 03:10:01.532 +0000 INFO [FlowRunnerManager] [Azkaban] Cleaning execution 8 from recently finished flows list.

CamusJob cannot be launched using hadoopJavaJob and Azkaban2

I am filing this issue because I encountered the same issue described at https://groups.google.com/d/msg/azkaban-dev/S9G9Lqmfm1Q/7pV0P7Re820J but there does not seem to be a bug report for yet.

The problem is that the CamusJob run() method signaturerun(String[] args) is not supported by the hadoopJavaJob plugin. This results in the following error:

14-08-2014 14:32:44 PDT consume_kafka ERROR - Caused by: java.lang.IllegalArgumentException: Can not create a Path from a null string
14-08-2014 14:32:44 PDT consume_kafka ERROR -   at org.apache.hadoop.fs.Path.checkPathArg(Path.java:87)
14-08-2014 14:32:44 PDT consume_kafka ERROR -   at org.apache.hadoop.fs.Path.(Path.java:99)
14-08-2014 14:32:44 PDT consume_kafka ERROR -   at com.linkedin.camus.etl.kafka.mapred.EtlMultiOutputFormat.getDestinationPath(EtlMultiOutputFormat.java:113)
14-08-2014 14:32:44 PDT consume_kafka ERROR -   at com.linkedin.camus.etl.kafka.CamusJob.run(CamusJob.java:181)

run() is invoked by default but it has not received the main.args to load the Camus specific properties, hence when run() is called it is missing key settings. It seems that the proper behaviour would be to call the Camus run(String[] args) method in order to have Camus properly initialize.

I am happy to give this a shot if I can get some pointers on how / what to adjust in the hadoopJavaJob plugin.

There is no need to add -Dreportal.execid=${azkaban.flow.execid} here - https://github.com/azkaban/azkaban-plugins/blob/master/plugins/reportal/src/azkaban/viewer/reportal/ReportalTypeManager.java#L59 - because Azkaban's JobRunner already inserts it here - https://github.com/azkaban/azkaban/blob/master/azkaban-execserver/src/main/java/azkaban/execapp/JobRunner.java#L536-539