summaryrefslogtreecommitdiffstats
Commit message (Collapse)AuthorAgeFilesLines
...
| * recoverd: Print banning message only after verifying pnnAmitay Isaacs2013-06-281-2/+2
| | | | | | | | | | | | Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 4be8dff3a4451192f838497b4747273685959bed)
| * recoverd: When updating flags on nodes, send updated flags and not old flagsAmitay Isaacs2013-06-281-1/+1
| | | | | | | | | | | | | | | | | | | | | | This was broken by commit a9a1156ea4e10483a4bf4265b8e9203f0af033aa. Instead of a SRVID_SET_NODE_FLAGS message to recovery daemon, a control was sent to the local daemon which in turn informed the recovery daemon. And while doing this change old flags were sent via CONTROL_MODIFY_FLAGS. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 7eb2f89979360b6cc98ca9b17c48310277fa89fc)
| * tools/ctdb: Add "force" option to "recover" commandMartin Schwenke2013-06-281-2/+9
| | | | | | | | | | | | | | | | | | | | At the moment there is no easy way to force a recovery when attempting to reproduce certain classes of bugs. This option is added without documentation because it is dangerous until the bugs are fixed! :-) Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 4f87925a287f612a6ab3b5da1a387a31c7bea28f)
| * client: Exit with non-zero status when unix socket is closedAmitay Isaacs2013-06-251-2/+2
| | | | | | | | | | | | Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 733fc909425860f6a02c205c2d8f34a731853922)
| * doc: Fix ctdb ping entry in manpageMartin Schwenke2013-06-221-2/+3
| | | | | | | | | | | | Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit abeb65ef02d018a7c14d4f8cea71e15c6cf9e357)
| * doc: Fix documentation for NoIPTakeover in ctdbd manpageMartin Schwenke2013-06-221-3/+4
| | | | | | | | | | | | Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 5d0215be5aefe492258a92c7bff2d41960379580)
| * doc: Update notification script section in ctdbd manpageMartin Schwenke2013-06-221-1/+6
| | | | | | | | | | | | | | | | The example notification script is now much more useful. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 4ba7c73eeab98296c9168e0b0fed1f6bb9f32733)
| * doc: Add nodestatus command to the ctdb manpageMartin Schwenke2013-06-221-0/+65
| | | | | | | | | | | | Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 4369c8e6ead9062ef7855ada375df74262acf925)
| * doc: Update NEWSMartin Schwenke2013-06-221-0/+26
| | | | | | | | | | | | Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit cd6227aa38d3bb4e5043faeffe436004e27b6d06)
| * tests: Integration tests use "ctdb nodestatus" for healthy cluster checkMartin Schwenke2013-06-221-15/+2
| | | | | | | | | | | | | | | | Also check that we're not in recovery mode. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit b7aaa28b3a6a2de923417f3d143f8d516447711e)
| * tests: Integration test infrastructure should do only a single recoveryMartin Schwenke2013-06-221-4/+0
| | | | | | | | | | | | | | | | No need for 2 recoveries after a restart. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit b953524185632d7f96a76d8f3bbed7ac1d143d40)
| * ctdbd: Fix panic on overlapping shutdownsMartin Schwenke2013-06-223-0/+9
| | | | | | | | | | | | | | | | | | | | The runstate can't be set to SHUTDOWN twice, so the current naive code causes a panic on the 2nd shutdown. This regression was introduced in commit 8076773a9924dcf8aff16f7d96b2b9ac383ecc28. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit f1b7ca8dc3f34a59c7b3e55748f974ac9ed8f458)
| * ctdbd: Refactor shutdown sequenceMartin Schwenke2013-06-224-21/+22
| | | | | | | | | | | | Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit b32fd04bfbf33062d45365b37a7247e272a76ceb)
| * eventscripts: "setup" event doesn't need to wait for SETUP runstateMartin Schwenke2013-06-201-15/+0
| | | | | | | | | | | | | | | | | | The "setup" event isn't called until ctdbd is in CTDB_RUNSTATE_SETUP anyway... Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 9ea57af557028b1d2e5c560e7bcf4d014b9a8b1e)
| * tests/eventscripts: New tests for 00.ctdb "init" eventMartin Schwenke2013-06-2016-6/+258
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | These test dropping of IPs and TDB checking. New stubs for date, tdbdump, tdbtool. Enhance ip stub to handle "ip addr show to ..." Tweak some infrastructure. Signed-off-by: Martin Schwenke <martin@meltin.net> Pair-programmed-with: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit aabf0bf41cb8ec344f06b69492fb6c2a27f9e900)
| * eventscripts: 13.per_ip_routing should not try hard to find public_addressesMartin Schwenke2013-06-201-1/+1
| | | | | | | | | | | | | | | | | | | | | | | | | | This essentially reverts d4621277240721e6d130a930b0100506b64467ea. This was added for testing but the test code was actually broken. CTDB itself will only process public IPs if $CTDB_PUBLIC_ADDRESSES is set, so no code should try to be more flexible than that! The test code has been fixed instead. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 3b11b27f3e22e99947bc2d6c49c4427bd7a0e332)
| * tests/eventscripts: setup_ctdb() should always set $CTDB_PUBLIC_ADDRESSESMartin Schwenke2013-06-201-1/+2
| | | | | | | | | | | | | | Signed-off-by: Martin Schwenke <martin@meltin.net> Pair-programmed-with: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit c3e7a6e10d486ba0dbafdf110db540675b2317bc)
| * logging: Notify parent when logging daemon is upMartin Schwenke2013-06-201-7/+33
| | | | | | | | | | | | | | | | | | | | | | | | Messages are lost until it is really up because syslogd_is_started is set too early. Adding a pipe to do the notification allows the parent to wait and only set syslogd_is_started when the logging daemon is actually ready. Signed-off-by: Martin Schwenke <martin@meltin.net> Pair-programmed-with: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit f3dd2eec200d6eeada2ea19cd7e76f1edfad6167)
| * scripts: Move TDB checking from initscript to "init" eventMartin Schwenke2013-06-202-98/+95
| | | | | | | | | | | | | | | | | | It makes sense to do this in the "init" event and make the initscript less complicated. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 3bc93f312b8464fbfa2b2c44fffedc591fe5a3e0)
| * scripts: Move dropping of all IPs from initscript to "init" eventMartin Schwenke2013-06-202-4/+4
| | | | | | | | | | | | | | | | | | It makes sense to do this in the "init" event and make the initscript less complicated. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 0b77cceb49a30a181063adc7868d42d2851318e8)
| * scripts: drop_ip() should use delete_ip_from_iface()Martin Schwenke2013-06-201-1/+1
| | | | | | | | | | | | | | | | | | Otherwise secondary addresses that aren't owned by CTDB could be dropped. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 5ffce65a1ad659b198ddf647622b899bdde45c72)
| * scripts: drop_all_public_ips() now prints messages to stdout, not logMartin Schwenke2013-06-203-11/+5
| | | | | | | | | | | | | | | | Change all callers to maintain current behaviour. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 0b67397ef5419c781a35916575151da7b7e7cc27)
| * ctdbd: "init" event should run earlier in daemon initialisationMartin Schwenke2013-06-201-7/+7
| | | | | | | | | | | | | | | | | | | | | | | | | | It should run before: * the transport is started; * databases are attached; and * processing configuration files (e.g. nodes, public_addresses). Signed-off-by: Martin Schwenke <martin@meltin.net> Pair-programmed-with: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 0a0c8543f167e11b75a622513367b083e42cbd3f)
| * tools/ctdb: Do not exit prematurely on control timeout if retrying in a loopAmitay Isaacs2013-06-201-5/+4
| | | | | | | | | | | | | | | | | | | | | | | | This avoids premature exits from "ctdb stop" and "ctdb continue" due to intermittent control (e.g. getpnn, getnodemap) timeouts. This needs a proper fix to distinguish between timeout and failure conditions and take appropriate action. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit c48583fd238496a81ddc46a21892f0b49559036a)
| * packaging: Update the minimum required library versionsAmitay Isaacs2013-06-171-3/+3
| | | | | | | | | | | | Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 5f8547b1531bba4950b3d873a997585c3a16d31e)
| * build: Enable VERBOSE option to display build command lineAmitay Isaacs2013-06-141-39/+47
| | | | | | | | | | | | | | | | make V=1 or make VERBOSE=1 will display build commands. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 02c63c591cc273122b3a547bb301b92f0e4bd217)
| * build: Fix tdb.h path to enable building with system TDB libraryMathieu Parent2013-06-1424-24/+24
| | | | | | | | (This used to be ctdb commit f8bf99de3a5f56be67aaa67ed836458b1cf73e86)
| * libctdb: Include config.h in libctdb/ctdb.cMathieu Parent2013-06-141-0/+1
| | | | | | | | | | | | Bug-Debian: http://bugs.debian.org/703551 (This used to be ctdb commit 14a79c0f3967c88f8ffc8200d122f6c5ffdb63a8)
| * ctdbd: Make sure we don't kill init process by mistakeAmitay Isaacs2013-06-141-1/+8
| | | | | | | | | | | | | | | | | | | | | | If getpgrp() fails, it will return -1 and that will send KILL signal to init process (PID 1). This does not happen on RHEL, but does on AIX. Reported-by: Chris Cowan <cc@us.ibm.com> Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit edb2a3556d03e248b42f63dd2c62382b723bc98f)
| * tests/eventscripts: Unit tests for $CTDB_NFS_DUMP_STUCK_THREADSMartin Schwenke2013-06-145-1/+63
| | | | | | | | | | | | | | | | Includes minor test infrastructure updates. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit cd4358b01c6c3d413b431f5760029d2b163b9c03)
| * tests/eventscripts: Fix -X tracing in iterate_test()Martin Schwenke2013-06-141-6/+5
| | | | | | | | | | | | | | | | ... and delete a bogus comment. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 0e2b5a8f89440a53f996482ac0c98b31a4f2cad3)
| * tests/eventscripts: Add unit tests for $CTDB_MONITOR_NFS_THREAD_COUNTMartin Schwenke2013-06-145-0/+53
| | | | | | | | | | | | | | | | Includes minor test infrastructure updates. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit ce2ef2be8aa22c0baf868daac8d4cf27246baa14)
| * eventscripts: New configuration varable $CTDB_NFS_DUMP_STUCK_THREADSMartin Schwenke2013-06-142-0/+30
| | | | | | | | | | | | | | | | | | | | | | If some nfsd threads are still alive after a shutdown during a restart then this indicates the maximum number of threads for which a stack trace should be dumped. This can be useful for trying to determine why nfsd is stuck. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 2503245db10d567af708a04edd3a3b488c24f401)
| * eventscripts: Add new option $CTDB_MONITOR_NFS_THREAD_COUNTMartin Schwenke2013-06-132-0/+38
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Consider the following example: 1. There are 256 nfsd threads configured. 2. 200 threads are "stuck" in system calls, perhaps waiting for the underlying filesystem when an attempt is made to restart NFS. 3. 56 threads exit when NFS is stopped. 4. 56 new threads are started when NFS is started. 5. 200 "stuck" threads exit leaving only 56 threads running. Setting this option to "yes" makes the 60.nfs monitor event look for this situation and try to correct it. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 99b0d8b8ecc36dfc493775b9ebced54539c182d2)
| * recoverd: Log node that causes takoever run to failMartin Schwenke2013-06-131-7/+11
| | | | | | | | | | | | | | | | | | | | | | | | Extend takeover_fail_callback() to just log (and not do any ban processing) when the callback data is NULL. Always call ctdb_takeover_run() with the callback so that useful errors are always logged. Signed-off-by: Martin Schwenke <martin@meltin.net> Pair-programmed-with: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit c429394afbabaee09f9216dc743419adddf523ea)
| * doc: Add release notes for 2.2Martin Schwenke2013-05-301-0/+65
| | | | | | | | | | | | Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit ac0892d3a57adb0587a37de0f94fa686bed8970f)
| * build: Fix extra whitespacesAmitay Isaacs2013-05-291-7/+7
| | | | | | | | | | | | Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 78cff9d54f241fb6a2943e50346f9c2ad9decc78)
| * tevent: Sync to tevent 0.9.18 from upstreamAmitay Isaacs2013-05-2928-726/+3782
| | | | | | | | (This used to be ctdb commit 82d61f77c01df0fbb42743593937b175ce22a445)
| * replace: Sync to latest replace from upstreamAmitay Isaacs2013-05-2941-4745/+1599
| | | | | | | | | | | | | | | | | | | | | | | | | | The latest commits affecting lib/replace remove autoconf build from Samba tree. So using following commit as a sync point. commit 9ddfd7d8784e6f546628f48990b69ee2850be52d Author: Andrew Bartlett <abartlet@samba.org> Date: Wed May 22 17:23:30 2013 +1000 Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 506b27c944b4031e8a325816bd12abddd442a0bb)
| * tdb: Sync to tdb 1.2.11 from upstreamAmitay Isaacs2013-05-2965-47/+6474
| | | | | | | | (This used to be ctdb commit bb3a32ec055432afc7225c9fd7504fb187694bda)
| * talloc: Sync to talloc 2.0.8 from upstreamAmitay Isaacs2013-05-2938-309/+3034
| | | | | | | | | | | | Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 3bffca8c17e441364525df115ee2ac16b5969e24)
| * ctdbd: Log node state transitions at higher debug levelAmitay Isaacs2013-05-291-2/+2
| | | | | | | | | | | | Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit db31dc48bd3135e9242af08bb79b67a17a2b1668)
| * git: Ignore generated ctdb.spec fileAmitay Isaacs2013-05-291-0/+1
| | | | | | | | | | | | Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit ca7ba26362eabfbcc329c66919d9c4da79c3b799)
| * git: Ignore ctdb_version.h fileAmitay Isaacs2013-05-291-1/+1
| | | | | | | | | | | | Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 641f539ffc7dd9542e669a3ec20c004f8bbcbf1e)
| * build: Use REPLACE_OBJ and CTDB_EXTERNAL_OBJ to simplify build rulesAmitay Isaacs2013-05-291-9/+13
| | | | | | | | | | | | | | | | | | This fixes the build on AIX where libreplace is required to build ctdb_lock_helper, ctdb_fetch_lock_once, ctdb_fetch_readonly_once. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit fa757b49374e44c2380d4457e9b0eb3582981fac)
| * build: Support for building on AIX xlc compilerAmitay Isaacs2013-05-291-2/+6
| | | | | | | | | | | | | | | | xlc does not support -fPIC, -Wno-format-zero-length Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 2cf95741fdab2ee5f724950a0b1ef257d6aeade7)
| * tests: Do not use err() to support AIXAmitay Isaacs2013-05-291-4/+6
| | | | | | | | | | | | Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 1734562a7b3512853b9e0232880c42d50c1c2e4c)
| * tests: Include system/time.h to support building on AIXAmitay Isaacs2013-05-293-7/+2
| | | | | | | | | | | | Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 0320bb4f8ca8171812ec7f41556aed847c74bfb4)
| * libctdb: Do not include sys/time.h to support build on AIXAmitay Isaacs2013-05-295-0/+6
| | | | | | | | | | | | Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 2c19fa78ce0b25c3615b23664df32233bdbdea42)
| * util: Do not stop build if backtracing is not supportedAmitay Isaacs2013-05-291-2/+1
| | | | | | | | | | | | Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit b091f09ea01482823bd850d1d4e2329e0a19c959)