Improved Dockerfile.gocd-agent #13

erithmetic · 2015-09-06T16:41:13Z

There is no need to use init scripts to start the agent service within a docker
container. This new image just runs the go agent directly. This solves a problem where the container didn't respond properly to signals like SIGINT/SIGTERM.

This image is based off of the official Java 7 headless JRE, which helps limit
image size. It's now 339MB vs 433MB.

This PR can also help fix #11

There is no need to use init scripts to start the agent service within a docker container. The Phusion team didn't really "get" docker when they built their images and published their recommendations a year ago. This solves a problem where the container didn't respond properly to signals like SIGINT/SIGTERM. This image is based off of the official Java 7 headless JRE, which helps limit image size. It's now 339MB vs 433MB. This PR can also help fix gocd#11

erithmetic · 2015-09-06T16:41:43Z

Consequently these kinds of fixes can be applied to the other images. I just wanted to run this PR by you all.

kmugrage · 2015-09-06T17:00:03Z

It appears that moving the host into the Dockerfile has made this not work when used with Compose.

This used to work...

server:
  image: gocd/gocd-server
  ports:
    - "8153:8153"
agent:
  image: gocd-agent
  links:
    - server:go-server

After applying your changes the agent wants to connect to localhost:8153 (or whatever I put in the Dockerfile of course) and not what's provided by Compose.

Any suggestions?

erithmetic · 2015-09-06T17:24:17Z

Ah yes, the GO_SERVER var should default to go-server. I'll fix

kmugrage · 2015-09-06T18:14:06Z

That does make mine work, but I'm still not sure hard coding it is the "right thing".

Interested in other opinions (that we're not likely to get on a holiday weekend)

erithmetic · 2015-09-06T21:57:36Z

The ENV directive sets the default value for the environment variable and it can be overridden at runtime with "docker run -e GO_SERVER=foo". Since the docs state that that default will be used I think this is a good solution.

erithmetic · 2015-09-11T13:47:58Z

Any other thoughts? Is this good to merge?

kmugrage · 2015-09-11T14:27:55Z

@arvindsv maintains this the most and he's out of the office this week. He'll be back early next week. Sorry for the delay!

dmg3 · 2015-10-16T16:30:18Z

What's the best way to add to the agent's PATH environment? I haven't been able to follow the logic from the /sbin/my_init to determine how I can change it

FredrikWendt · 2015-10-17T06:41:26Z

@dkastner I take it you disagree with http://phusion.github.io/baseimage-docker/ on why you should use an init system in your Docker container? Would you care to explain why?
Would also care to explain why your builds, run by gocd-agent, should require root privileges?

erithmetic · 2015-10-18T14:04:36Z

@dmg3 if you really need to override PATH, you can do it at runtime with docker run -e PATH=/usr/bin:/my/new/path or you can modify it in a custom Dockerfile/image build:

FROM gocd/gocd-agent

ENV PATH=/usr/bin:/my/new/path

erithmetic · 2015-10-18T15:00:16Z

@FredrikWendt

re: Phusion's philosophy:

Phusion came up with its baseimage while docker was still in its early stages and they were operating under the sort of assumptions you make when you have a "snowflake" server that needs to have maximal uptime. I'll address each of their reasons for their baseimage:

chron

We don't need chron. If we did, it'd be better to build a specialized image that installs chron and is only in charge of running chron, not your app and a bunch of other processes in addition to chron.

syslog

AFAIK We don't need it.

ssh

Needing to ssh into a container is an antipattern. The recommended practice is to not auto-rm containers when they die, allowing you to inspect them with docker exec and docker cp to investigate crashes.

runit/daemon monitoring

This isn't needed with docker's --restart flag and/or platforms like ECS or Kubernetes.

zombie processes

This shouldn't be a problem if the agent properly waits for the command processes it executes.

re: root privileges.

It's not that anything should require root privileges, it's just that with docker everything can run as root because it's all secured within LXC's user namespace. In other words, the root user here is "jailed" and unable to affect the host OS. If there is a security vulnerability in the agent itself, the container will be compromised, but the effects will be limited.

For any of these concerns, we don't need to make gocd-agent responsible. Users can build custom images using this barebones image which is much smaller than the phusion baseimage.

dmg3 · 2015-10-19T13:17:31Z

@dkastner Thanks! That's the piece I was missing!

On Sun, Oct 18, 2015 at 9:04 AM, Derek Kastner [email protected]
wrote:

@dmg3 https://github.com/dmg3 if you really need to override PATH, you
can do it at runtime with docker run -e PATH=/usr/bin:/my/new/path or you
can modify it in a custom Dockerfile/image build:

FROM gocd/gocd-agent
ENV PATH=/usr/bin:/my/new/path

—
Reply to this email directly or view it on GitHub
#13 (comment).

labianchin · 2015-11-04T20:43:04Z

I think @dkastner approach is very good. I also believe other images (server, dev, build-installer) should also favor using debian or java official image, instead of phusion/baseimage.

Please also see jpetazzo's article on why thin images are better: https://jpetazzo.github.io/2014/06/23/docker-ssh-considered-evil/

manojlds · 2015-11-06T09:33:23Z

This change is very useful, using it for my purposes.

labianchin · 2015-11-09T14:13:06Z

I just notice one problem with this image: zombie process and reaping, more here:
http://blog.phusion.nl/2015/01/20/docker-and-the-pid-1-zombie-reaping-problem/

I noticed that folks from Jenkins, solved that using a tool called tini, the discussion is here: jenkinsci/docker#54 (comment)

I am working on solving this, still WIP here https://github.com/labianchin/gocd-docker/blob/no-daemon2/gocd-agent/Dockerfile. When it is done, I will create a PR.

Other thing that would be nice is to check the SHA1 sum from the downloaded deb package.

erithmetic · 2015-11-09T15:27:26Z

As far as I understand, the only reason a process could become a zombie is if:

Someone manually sends a kill -9 to a process that Go is running that happens to have subprocesses
Go sends a SIGKILL to the task it is running
The OOM killer times out when trying to SIGTERM a process and sends it SIGKILL

Are there any other instances?

The first source of zombies seems like a "user error" problem - in that if you go around kill -9ing things, I'm inclined to say you have no business expecting the system to clean up after you.

I believe the second case could be solved by Go itself. If Go needs to kill subprocesses, it should be assigning subprocesses to process groups and sending kill signals to the entire group.

The tini solution seems to be a good short-term fix, but the PR_SET_CHILD_SUBREAPER setting it supports could theoretically be handled by Go itself (and would be a great addition on any platform), although I'm not familiar with java enough to know if it can support those kernel features.

gkid · 2016-01-05T16:33:42Z

Great idea - This is almost exactly like the my version. Which I created for the exact same reasons @dkastner mentions. I just used the gocd zip file download instead of the installer to have all the files conveniently in one place and more control over other stuff like the java version. I also dockerized the server in the same manner.

I'd suggest adding agent.auto.register.hostname ($AGENT_HOSTNAME) to the autoregister.properties file (available since v15.2) in go-agent-start.sh in the same manner as $AGENT_RESOURCES and $AGENT_ENVIRONMENTS. In my version I actauly add $AGENT_HOSTNAME-$(hostname) so I also have the real hostname of the (docker)machine so it can easily be traced.

You might also wish to make some compatibility with running from zip install. The only two things that need to be considered (in go-agent-start.sh) are:
a. $AGENT_CONFIG_DIR (in my version if it doesn't exist then I set it to the current working dir/config (and create the directory).
b. location of (i execute from current dir - exec ./agent.sh)

btw - prior to version 15.3.0, the agent.sh script starts the agent with eval "$CMD" (not exec) - so it does not become the root process nor receive signals.

arvindsv · 2016-03-10T22:11:31Z

I'm trying to incorporate some of the changes in this PR into #27. Feedback welcome.

labianchin · 2016-03-10T22:14:30Z

Hi @arvindsv. Also take a look on @tanob images tanob/gocd-docker. He is using it for a couple of months already in his project.

arvindsv · 2016-03-10T22:17:11Z

Thanks, @labianchin. I will.

zabil · 2016-08-03T05:25:16Z

Closing in favour of #43

Default GO_SERVER to "go-server"

1b9ec3b

Simply run the agent as root

50ef339

arvindsv mentioned this pull request Mar 10, 2016

RFC: Move away from phusion/baseimage #27

Closed

zabil closed this Aug 3, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improved Dockerfile.gocd-agent #13

Improved Dockerfile.gocd-agent #13

erithmetic commented Sep 6, 2015

erithmetic commented Sep 6, 2015

kmugrage commented Sep 6, 2015

erithmetic commented Sep 6, 2015

kmugrage commented Sep 6, 2015

erithmetic commented Sep 6, 2015

erithmetic commented Sep 11, 2015

kmugrage commented Sep 11, 2015

dmg3 commented Oct 16, 2015

FredrikWendt commented Oct 17, 2015

erithmetic commented Oct 18, 2015

erithmetic commented Oct 18, 2015

dmg3 commented Oct 19, 2015

labianchin commented Nov 4, 2015

manojlds commented Nov 6, 2015

labianchin commented Nov 9, 2015

erithmetic commented Nov 9, 2015

gkid commented Jan 5, 2016

arvindsv commented Mar 10, 2016

labianchin commented Mar 10, 2016

arvindsv commented Mar 10, 2016

zabil commented Aug 3, 2016

Improved Dockerfile.gocd-agent #13

Improved Dockerfile.gocd-agent #13

Conversation

erithmetic commented Sep 6, 2015

erithmetic commented Sep 6, 2015

kmugrage commented Sep 6, 2015

erithmetic commented Sep 6, 2015

kmugrage commented Sep 6, 2015

erithmetic commented Sep 6, 2015

erithmetic commented Sep 11, 2015

kmugrage commented Sep 11, 2015

dmg3 commented Oct 16, 2015

FredrikWendt commented Oct 17, 2015

erithmetic commented Oct 18, 2015

erithmetic commented Oct 18, 2015

re: Phusion's philosophy:

chron

syslog

ssh

runit/daemon monitoring

zombie processes

re: root privileges.

dmg3 commented Oct 19, 2015

labianchin commented Nov 4, 2015

manojlds commented Nov 6, 2015

labianchin commented Nov 9, 2015

erithmetic commented Nov 9, 2015

gkid commented Jan 5, 2016

arvindsv commented Mar 10, 2016

labianchin commented Mar 10, 2016

arvindsv commented Mar 10, 2016

zabil commented Aug 3, 2016