Terraform-aws-github-runner: Automate the creation of an offline runner

Created on 6 Feb 2021 · 11Comments · Source: philips-labs/terraform-aws-github-runner

The current approach requires we alwsys have an offline runner regirstered. Those offline runners are removed by github eacht 60 days. Therefore it would be convient to automate the process to keep 1 offline runner so we can scale back to 0.

Potential solution direction

Register via ED2 instance

Use the same mechanism we use to spin up runners with an extra lambda that only execute the user data till the config step. And also ensure the ec2 instance is take down

Reverse egineer github config process

Create lamda that is using github http calls based on reverse engineering, see https://github.com/actions/runner/issues/558

Run the config in a lambda

Create a lambda that can exectue the config via a lambda layer

enhancement help wanted

Source

npalm

👀5

Most helpful comment

@npalm I've figured out most of the logic, urls, etc involved with registering a runner. I plan on making a python module to handle this using python 3x and requests. This will probably be this weekend when I'm off work.

If you would prefer to implement it yourself, I can share my notes I've gathered about the process before then.

I honestly think I want to spend some more time tracing the entire process, so it could actually pull and perform workflows as well, but the new runner registration was a must have for me.

Basically the process I have found to work:
Get runner token via api using pat or app creds
Post token to secretish endpoint, receive back json data with new secretish endpoint and a jwt
Use jwt to auth via auth bearer header, using newly revealed endpoint, can query existing "agents"(as they are called in the api), add a new one, or update an existing one.

There's obviously some more details involved, including creating an RSA key, and a bunch of headers. I haven't looked further than registration yet...

miked63017 on 9 Feb 2021

👍3 🎉1

All 11 comments

If you would prefer to implement it yourself, I can share my notes I've gathered about the process before then.

I honestly think I want to spend some more time tracing the entire process, so it could actually pull and perform workflows as well, but the new runner registration was a must have for me.

There's obviously some more details involved, including creating an RSA key, and a bunch of headers. I haven't looked further than registration yet...

miked63017 on 9 Feb 2021

👍3 🎉1

@gertjanmaas I think you will like the comment above

npalm on 9 Feb 2021

👍1

@miked63017 let me know if you don't get to it. We are looking for this as well so I would have some time to work on it.

Edit: Not sure if you saw, but this is newly released for Python: https://github.blog/2020-12-18-learn-about-ghapi-a-new-third-party-python-client-for-the-github-api/

mcaulifn on 10 Feb 2021

👍1

@npalm @mcaulifn here is a link, it's still pretty beta'ish and not well documented, but I guess we can say the same thing about the runners/actions API in general :-)

https://github.com/miked63017/pyghrunner

miked63017 on 14 Feb 2021

@mcaulifn in RE to the ghapi module it looks cool, but most of these calls are undocumented pieces of the api, and probably subject to change.

miked63017 on 14 Feb 2021

Overall looks like it should work. Are you planning on adding it to this repo?

mcaulifn on 2 Mar 2021

@gertjanmaas any opinion?

npalm on 2 Mar 2021

@npalm @mcaulifn not sure if I have the context to add it here to this repo, I am personally working in a GKE operator to do similar but figured it could help others to share some simple code to integrate with other projects. Seems to be a fairly common request for this functionality. If you'd like me to take a crack at adding here via a PR I can maybe spend some time this weekend.

miked63017 on 3 Mar 2021

👍2

I quickly skimmed through the python code and it seems to confirm what I saw when I tried to reverse engineer it a while ago. Would be great if this could be implemented here. Getting tired of adding offline runners by hand :P

gertjanmaas on 3 Mar 2021

@gertjanmaas where I am running (the equivalent) of this code(in a private library), we just run the few methods periodically, or in response to an event, and overwrite the previous "virtual runner". We are basically just using it as a placeholder so jobs queue rather than fail because no runners with labels exist. Then we look at the jobs details, and spin up the appropriate runner, with appropriate labels as needed, and with the --once flag.

I still have plans of further investigation into creating a full custom runner, most likely written in python, that can then be embedded in other places. This just hasn't been high priority for me yet.

miked63017 on 4 Mar 2021

How about runner deregistration?

The offline runner basically needs to be recreated every 30 days, in order to never have 0 runners in the org.

This should be automated as well.