Switch job scheduling over to pungi4 + fedmsg
ClosedPublic
Actions

Authored by adamwill on Feb 24 2016, 1:32 AM.

Details

Reviewers

garretraziel
jskladan

Commits

rOPENQA9618771443cb: Pungi 4 conversion: no more fedfind
rOPENQAc809f31f22a5: listener: twiddle hashbang a bit
rOPENQA748b5b544ba0: actually, no, let's just hardcode the atomic stuff
rOPENQA6c662ef12a16: listener: allow ctrl-c
rOPENQA0a7128a58c0e: add the little stub script to run the consumer from git
rOPENQA5147ae348159: pungi4: drop the systemd units
rOPENQA3b50fd32ef8d: pungi4: go with Atomic as the variant name
rOPENQAa626a6151d23: move fedmsg consumer into the package, install with setuptools
rOPENQA0636117e5b42: add a simple fedmsg listening job scheduler (John Dulaney)
rOPENQA63e872af09bb: add a systemd service for the fedmsg consumer
rOPENQA54bd4fd1a0f2: pungi4 images: fudge up 'payload', separate match dict
rOPENQA600fb26bf3e1: pungi4: drop the Rawhide 'VERSION' hack, dgilmore fixed it
rOPENQA836525d4b953: make get_images 'private', add _get_compose_id
rOPENQA32dbefbec12f: pungi4: use fedfind (sigh) to cope with some...details
rOPENQA58775ee89db2: fedmsg consumer: handle old-style two-week atomic fedmsgs too

Summary

This is a big diff with all the changes from the 'pungi4'
branch - that branch has the changes split into multiple commits.

This pretty much entirely rewrites scheduling so instead of using
fedfind to find composes and wait for them to exist and find images
in them, and having systemd timers with implied knowledge about when
various types of compose show up, we listen out for fedmsg messages
to tell when a new compose has appeared, and we use the Pungi 4
metadata to decide what images we want to download and test.

Ideally we'd like to have the fedmsg listening bits be a Taskotron
trigger and task, but that's still going through review at present
so we're just going to use a simple standalone consumer for now.

One annoying issue is that we still want to test the daily 'two week
Atomic' test composes, and those will not be done with Pungi 4 yet.
So we have a couple of small hacks in the fedmsg consumer and some
more dumb hacks in the scheduler to cope with those: we listen out
for the fedmsg's from that compose process as well as Pungi 4
fedmsg's, and we just hard code the expected location and metadata
of the single ISO we actually want to test within such a compose. At
first I wrote a whole 'clever' layer in fedfind to synthesize Pungi
4-y metadata for a non-Pungi-4 compose, but it was way too much code
for the job we really need to do, this is much simpler.

Test Plan

Install it (you'll need to clean up the old systemd units
manually, unfortunately, setuptools is pretty dumb), start the new
consumer service, wait for a compose to happen and see if you get
some jobs. fedmsg-dg-replay may help test the fedmsg consumer-y bits,
and to test the rest of it you can use the CLI (it's been rejigged
a lot and now simply accepts a compose location), or just hook right
into jobs_from_compose.

There will be a companion diff for openqa_fedora that updates the
flavor names (and fixes a few other things up to work in a world
where we use the Pungi 4 'compose ID' as the openQA 'build').

Diff Detail

Repository

rOPENQA fedora_openqa

Lint

Automatic diff as part of commit; lint not applicable.

Unit

Automatic diff as part of commit; unit tests not applicable.

adamwill retitled this revision from to Switch job scheduling over to pungi4 + fedmsg.Feb 24 2016, 1:32 AM

adamwill updated this object.

adamwill edited the test plan for this revision. (Show Details)

adamwill added reviewers: jskladan, garretraziel.

Herald added a subscriber: tflink. · View Herald TranscriptFeb 24 2016, 1:32 AM

So here's why I sent this now:

<adamw> dgilmore: so what's your current thinking wrt pungi4 switchover?
<dgilmore> adamw: branched is not enabled
 adamw: and rawhide is disabled
<dgilmore> I am going to change it
 and will have to work as we go on the missing things
<dgilmore> adamw: I may run it manually tomorrow
<dgilmore> I will need to get nirik to have someone reinstall branched and rawhide composer boxes
<adamw> when you say 'rawhide is disabled', you mean at this point we are getting no more old-style rawhide nightlies?
<dgilmore> adamw: correct
<adamw> ok
 and we will never get any old-style 24 branched nightlies?
 (unless we decide all this is awful and we have to change our plans)
<dgilmore> correct
<adamw> okay.

So we should have a pretty low bar for merging this, since the current code will basically never manage to test anything but two-week atomic composes ever again (even the current bit is very unlikely to work any more when we get around to figuring out how TCs and RCs are going to work, based on my poke through the Pungi 4 code today, they're gonna look different).

adamwill mentioned this in D754: Pungi 4 conversion: handle Pungi-derived BUILD and FLAVOR.Feb 24 2016, 2:03 AM

this using ISOURL is correct for now as the openqa update which would make us have to use ISO_URL is still in updates-testing ATM, though I'll push it stable ASAP and we'll have to update that.

Other than my comments, lgtm.

scheduler/setup.py
18	We should bump version also :-).
32	I think that fedfind is still needed for `fedfind.helpers`.

This revision is now accepted and ready to land.Feb 24 2016, 8:57 AM

ACK, code looks good. Get rid of the stupid copyright/license clause, or replace it by the short version - having a header longer than the code is silly...

scheduler/fedora_openqa_schedule/consumer.py
2–21 ↗	(On Diff #1935)	This is really unnecessary, and we don't do it anywhere in the rest of the code. If @jdulaney really needs to have "copyright" on a piece of code, that is basically just a copy of fedmsg consumer example script, then be it, but do it this way: # Copyright 2016 John Dulaney # License: GPL-2.0+ <http://spdx.org/licenses/GPL-2.0+> # Authors: John Dulaney ... Adam Williamson ...

Also please update config sample.

Not sure whether it's true in production, but I tried to use fedmsg-dg-replay to replay this message and consume() function doesn't receive only "msg" part, but it receives whole message like this:

{u'username': u'jsedlak',
 u'i': 1,
 u'timestamp': 1456312058,
 u'msg_id': u'2016-1db3386a-5b35-4f69-8a1d-1d23d0bcc1ae',
 u'topic': u'org.fedoraproject.dev.pungi.compose.status.change',
 u'msg': {u'status': u'FINISHED_INCOMPLETE',
   u'location': u'http://kojipkgs.fedoraproject.org/compose//rawhide/Fedora-Rawhide-20160222.n.0/compose',
   u'compose_id': u'Fedora-Rawhide-20160222.n.0'}
}

Please verify it.

This revision now requires changes to proceed.Feb 24 2016, 11:11 AM

Also, when I use msg = msg['msg'] at the beginning of consume function, it schedules jobs, but then it shows:

logger.info("Jobs run on %s: %s", compose, ' '.join(jobs))
TypeError: sequence item 0: expected string, int found

problem is that jobs is list of ints, but join only works on list of strings.

thanks for testing! you may be able to tell I didn't ;) (well I tested the bits below fedmsg, but not fedmsg, it was too late). I will clean up the problems and test before merge, and check with threebean the format the message actually arrives in.

scheduler/fedora_openqa_schedule/consumer.py
2–21 ↗	(On Diff #1935)	"This is really unnecessary, and we don't do it anywhere in the rest of the code." Actually we do, the rest of the scheduler uses it too. I took the format from some 'best practices' thing somewhere. Since the consumer is part of the scheduler package now I think it makes sense to keep the format consistent... if we decide to change it, let's change the whole scheduler as one separate commit.

Closed by commit rOPENQA63e872af09bb: add a systemd service for the fedmsg consumer (authored by adamwill). · Explain WhyFeb 24 2016, 5:35 PM

This revision was automatically updated to reflect the committed changes.

Revision Contents

			Path	Packages
M			scheduler/setup.py (3 lines)
A	M		scheduler/systemd/openqa-consumer.service (6 lines)

Diff	ID	Base	Description	Created	Lint	Unit
Base			Base
Diff 1	1935	b74e3f0		Feb 24 2016, 1:32 AM	★	★
Diff 2	1941	0a7128a	rOPENQA63e872af09bb03d91fbc2528e5c52351e1bff1f7	Feb 24 2016, 12:53 AM	★	★

Commit	Tree	Parents	Author	Summary	Date
63e872af09bb	86a581ffd2e2	0a7128a58c0e	Adam Williamson	add a systemd service for the fedmsg consumer	Feb 24 2016, 12:53 AM
0a7128a58c0e	152f9f029b36	a626a6151d23	Adam Williamson	add the little stub script to run the consumer from git	Feb 24 2016, 12:23 AM
a626a6151d23	5aeff9e2434d	c809f31f22a5	Adam Williamson	move fedmsg consumer into the package, install with setuptools (Show More…)	Feb 24 2016, 12:20 AM
c809f31f22a5	ccfd88c75e3f	6c662ef12a16	Adam Williamson	listener: twiddle hashbang a bit	Feb 24 2016, 12:16 AM
6c662ef12a16	3da8d1209184	58775ee89db2	Adam Williamson	listener: allow ctrl-c	Feb 24 2016, 12:16 AM
58775ee89db2	6b9c0e94a4fb	0636117e5b42	Adam Williamson	fedmsg consumer: handle old-style two-week atomic fedmsgs too (Show More…)	Feb 24 2016, 12:13 AM
0636117e5b42	0596d27caa3e	3b50fd32ef8d	Adam Williamson	add a simple fedmsg listening job scheduler (John Dulaney) (Show More…)	Feb 23 2016, 8:40 PM
3b50fd32ef8d	6818b08ac13a	748b5b544ba0	Adam Williamson	pungi4: go with Atomic as the variant name (Show More…)	Feb 23 2016, 8:34 PM
748b5b544ba0	180851b168cd	32dbefbec12f	Adam Williamson	actually, no, let's just hardcode the atomic stuff (Show More…)	Feb 23 2016, 6:57 PM
32dbefbec12f	b45afe722a9a	600fb26bf3e1	Adam Williamson	pungi4: use fedfind (sigh) to cope with some...details (Show More…)	Feb 23 2016, 8:55 AM
600fb26bf3e1	45f4792597f7	54bd4fd1a0f2	Adam Williamson	pungi4: drop the Rawhide 'VERSION' hack, dgilmore fixed it (Show More…)	Feb 20 2016, 12:14 AM
54bd4fd1a0f2	e55482ab6d31	5147ae348159	Adam Williamson	pungi4 images: fudge up 'payload', separate match dict (Show More…)	Feb 19 2016, 11:59 PM
5147ae348159	0716c7029fb7	836525d4b953	Adam Williamson	pungi4: drop the systemd units (Show More…)	Feb 19 2016, 7:40 PM
836525d4b953	42ad9dacb31e	9618771443cb	Adam Williamson	make get_images 'private', add _get_compose_id (Show More…)	Feb 2 2016, 2:24 PM
9618771443cb	b52f253d6df5	b74e3f0aae55	Adam Williamson	Pungi 4 conversion: no more fedfind (Show More…)	Jan 28 2016, 5:01 AM

Diff 1941

View Options

scheduler/setup.py

Show All 9 Lines
10	def read(fname):	10		def read(fname):
11	return open(os.path.join(os.path.dirname(__file__), "..", fname)).read()	11		return open(os.path.join(os.path.dirname(__file__), "..", fname)).read()
12		12
13	# Allow modification of systemd install location via env var.	13		# Allow modification of systemd install location via env var.
14	SYSTEMDUNITPATH = os.getenv("SYSTEMDUNITPATH", '/usr/lib/systemd/system')	14		SYSTEMDUNITPATH = os.getenv("SYSTEMDUNITPATH", '/usr/lib/systemd/system')
15		15
16	setup(	16		setup(
17	name = "fedora-openqa",	17		name = "fedora-openqa",
18	version = "1.0",	18		version = "1.0",
			garretrazielUnsubmitted Not Done We should bump version also :-).
19	entry_points = {	19		entry_points = {
20	'console_scripts': [	20		'console_scripts': [
21	'fedora-openqa-schedule = fedora_openqa_schedule.cli:main',	21		'fedora-openqa-schedule = fedora_openqa_schedule.cli:main',
22	'fedora-openqa-consumer = fedora_openqa_schedule.consumer:main',	22		'fedora-openqa-consumer = fedora_openqa_schedule.consumer:main',
23	],	23		],
24	},	24		},
25	author = "Fedora QA devel team",	25		author = "Fedora QA devel team",
26	author_email = "qa-devel@lists.fedoraproject.org",	26		author_email = "qa-devel@lists.fedoraproject.org",
27	description = "Fedora openQA scheduler",	27		description = "Fedora openQA scheduler",
28	license = "GPLv3+",	28		license = "GPLv3+",
29	keywords = "fedora openqa test qa",	29		keywords = "fedora openqa test qa",
30	url = "https://bitbucket.org/rajcze/openqa_fedora_tools",	30		url = "https://bitbucket.org/rajcze/openqa_fedora_tools",
31	packages = ["fedora_openqa_schedule"],	31		packages = ["fedora_openqa_schedule"],
32	install_requires = ['openqa-client', 'setuptools', 'six'],	32		install_requires = ['openqa-client', 'setuptools', 'six'],
			garretrazielUnsubmitted Not Done I think that fedfind is still needed for `fedfind.helpers`.
33	long_description=read('README.md'),	33		long_description=read('README.md'),
34	classifiers=[	34		classifiers=[
35	"Development Status :: 3 - Alpha",	35		"Development Status :: 3 - Alpha",
36	"Topic :: Utilities",	36		"Topic :: Utilities",
37	"License :: OSI Approved :: GNU General Public License v3 or later "	37		"License :: OSI Approved :: GNU General Public License v3 or later "
38	"(GPLv3+)",	38		"(GPLv3+)",
39	],	39		],
		40		data_files=[
		41		(SYSTEMDUNITPATH, glob.glob('systemd/*.service')),
		42		],
40	)	43		)