samba.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
...
\| *	recoverd: New function do_takeover_run()	Martin Schwenke	2013-09-19	1	-21/+31
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Factor the calling sequence for ctdb_takeover_run() into a new function and call it instead. This changes rec->need_takeover_run to false for each successful takeover run and that seems to be the right thing to do. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 9a3f0c0e61ca5c17e020c6e0463d73c7cf4f7c09)
\| *	recoverd: Stabilise the recovery master role	Martin Schwenke	2013-09-19	1	-0/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	On rare occasions when a node that has been inactive it will trigger an election when it becomes active again. If that node has been up for the longest then it will win the election and the recovery master role will spuriously move. While a node remains inactive we reset the priority time to discourage it from winning elections. The priority time will now reflect roughly how long the node has been active rather than how long it has been up. That means the most stable node is more likely to win elections. Having a stable recovery master means that disabling takeover runs while reloading IPs is more likely to succeed. It also improves the chances of being able to cache information in the recovery master - for example, between takeover runs. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit f0f48f22f45e4c82eba2582efae307e25385de81)
\| *	recoverd: Banned nodes should not be told to run "ipreallocated" event	Martin Schwenke	2013-09-18	1	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	They will reject it because they are in recovery. This can result in extra banning credits being applied to banned nodes. This corresponds to commit 9132e6814ed927fa317f333f03dedb18f75d0e5b from the 1.2.40 branch. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 403938804caf1322f9773d63197e4303a7b2a788)
\| *	common: Make parse_ip() valgrind-clean	Martin Schwenke	2013-09-11	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Signed-off-by: Martin Schwenke <martin@meltin.net> Pair-programmed-with: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit c0bb147ca09e82019b05ec22995623cffc3184e2)
\| *	recoverd: Remove an orphaned comment	Martin Schwenke	2013-09-11	1	-4/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This should have been removed with the associated code in commit 14bd0b6961ef1294e9cba74ce875386b7dfbf446. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 36de63843de10a1f2a9ccdbbee24cc1d08542984)
\| *	recoverd: Update a comment to use current terminology	Martin Schwenke	2013-09-11	1	-3/+4
\| \| \| \| \| \| \| \| \| \| \| \|	Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit ea5576071b22e1877903ec0921d375626a23e13b)
\| *	client: Remove unused function list_of_active_nodes_except_pnn()	Martin Schwenke	2013-09-11	2	-12/+0
\| \| \| \| \| \| \| \| \| \| \| \|	Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit d8a76cf79f07dfb5a93c6c9a13f16e3268c7dd57)
\| *	tools/ctdb: list_of_active_nodes_except_pnn() -> list_of_nodes()	Martin Schwenke	2013-09-11	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	list_of_active_nodes_except_pnn() is only used here and can be removed if we remove this call. Less is more... Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit d4e206fb818048b7fab4797c877b854bdbb1ab70)
\| *	tools/ctdb: Fix a memory leak in parse_nodestring()	Martin Schwenke	2013-09-11	1	-2/+3
\| \| \| \| \| \| \| \| \| \| \| \|	Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 8753a094b97340deb26dd44f6ea345ca0a642a95)
\| *	tests/eventscripts: Tests for memory checking in 00.ctdb	Martin Schwenke	2013-09-11	10	-2/+166
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	... plus updates to test infrastructure to support. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 4a388fc6bf54636b7e1f6da8e6aa451cddd574f7)
\| *	eventscripts: Clean up monitoring of system memory in 00.ctdb	Martin Schwenke	2013-09-11	1	-30/+41
\| \| \| \| \| \| \| \| \| \| \| \|	Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 16fcff0d1993b7a0479341862ea44d10bd5c6d6d)
\| *	server: standardize formatting of comment block for ctdb_reply_dmaster() ↵	Michael Adam	2013-08-26	1	-7/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	while I'm at it.. This was the comment block I was touching and meant to adapt in commit 00d3bf092e2f72eda330978c75ec85f17e870553. My search was apparently not unique... Signed-off-by: Michael Adam <obnox@samba.org> (This used to be ctdb commit 09940255011b119dc6af3304f5d3e9568e6006fd)
\| *	doc: Update NEWS	Martin Schwenke	2013-08-22	1	-0/+78
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Signed-off-by: Martin Schwenke <martin@meltin.net> Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit c446579fc442955ecc74f5566eaa0635c3171498)
\| *	build: Fix build dependencies for ctdb_lock_tdb	Amitay Isaacs	2013-08-22	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \|	Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit eb8575718400c45626cd1b2e0fd247bc3ebff655)
\| *	tests/simple: Minimise the chance of a monitor event being cancelled	Martin Schwenke	2013-08-22	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	A monitor event following a "ctdb delip" might reconfigure services. If the monitor event is cancelled then a service might be stopped but not yet restarted and this could result in the subsequent monitor events failing. This obviously needs to be fixed in CTDB itself. This will happen by making "ctdb reloadips" the supported way of reconfiguring IPs. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 618ea3660e36e7bd92b686e1ca8728cf63c3c068)
\| *	packaging: Remove pushd/popd from maketarball.sh, don't need bash	Martin Schwenke	2013-08-22	1	-45/+34
\| \| \| \| \| \| \| \| \| \| \| \|	Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 3ffca990a18cbd31c8bd3ae01c6671d60da58f58)
\| *	tools/ctdb_diagnostics: Add output of "ctdb getdbmap"	Martin Schwenke	2013-08-22	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \|	Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit f0d69a9079b7aecc68f1d2d8510702046b618b19)
\| *	tools/ctdb_diagnostics: Safer temporary file creation	Martin Schwenke	2013-08-22	1	-3/+10
\| \| \| \| \| \| \| \| \| \| \| \|	Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 406e1cb1fdd17ddd239774d0228e3657b73ae68f)
\| *	eventscripts: Avoid using a temporary file in 62.cnfs	Martin Schwenke	2013-08-22	1	-4/+1
\| \| \| \| \| \| \| \| \| \| \| \|	Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 81833052d7ee8f76b1e98376a0273448640cfa8e)
\| *	scripts: Remove gdb_backtrace	Martin Schwenke	2013-08-22	1	-87/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This uses potentially insecure temporary files and is not referenced anywhere else. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 4b914d7e217202f3d11a8e95f9f74bc17869475b)
\| *	tools/ctdb: Make most non-auto-all commands abort if run with -n all	Martin Schwenke	2013-08-22	1	-6/+42
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Or if run with -n A,B,... Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit b1d8732b5da18ae80aea1df0e66b0b5cdcd919bc)
\| *	tools/ctdb: Remove more non-essential fetching of PNN from daemon	Martin Schwenke	2013-08-22	1	-25/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The useful cases are either CTDB_CURRENT_NODE, in which case ctdb_get_pnn() does the job, or a PNN, which is... ummm... a PNN! :-) This works because parse_nodestring() validates PNNs. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 7b3f7eea2465efb099a2faf3e42174bc97b13a16)
\| *	tools/ctdb: Improve auto-all settings for some commands	Martin Schwenke	2013-08-22	1	-8/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* ipreallocate is cluster-wide so should not be auto-all * enablescript, disablescript, getreclock, setreclock, natgwlist can all be auto-all without issues * xpnn, ipiface a local-only so don't work with -n, so might as well not be auto-all Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 123a4677528cb46bee1c6dad8a5162eba9880bc1)
\| *	recoverd: Remove an unused temporary talloc context	Martin Schwenke	2013-08-22	1	-3/+0
\| \| \| \| \| \| \| \| \| \| \| \|	Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit da22d5e60dc023009854025cc9e6bc4b0a84c60e)
\| *	recoverd: Move struct ctdb_public_ip_list back into ctdb_takeover.c	Martin Schwenke	2013-08-22	2	-6/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is an internal structure. It was moved into ctdb_private.h a long time ago to allow unit testing. Unit test compilation was changed shortly afterwards to make this unnecessary. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit db57261d7dc264e161659a8c547f44fbd9e88eeb)
\| *	recoverd: Log more information when interfaces change	Martin Schwenke	2013-08-22	1	-2/+15
\| \| \| \| \| \| \| \| \| \| \| \|	Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 3ef93a1a3e60cdf5d8954e7a16a988ea6126916b)
\| *	traverse: Log when database traverse is started	Amitay Isaacs	2013-08-22	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \|	Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 256b157232c60bc432c94e54b1fae9699f737557)
\| *	ctdbd: Finish eventscript callback processing before debugging hung script	Amitay Isaacs	2013-08-22	1	-26/+47
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This ensures that the result of eventscripts is updated and callback is processed before debugging hung script. So "ctdb scriptstatus" output will be useful from debug hung script. Signed-off-by: Amitay Isaacs <amitay@gmail.com> Pair-Programmed-With: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 4ed2efb838d2ac97746666f614ebef5fdf3cdd5e)
\| *	ctdbd: Make sure call data is freed if doing an early return	Amitay Isaacs	2013-08-22	1	-2/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This should avoid memory bloat when a request bounces between nodes. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 7677fb263f06a97398e2c546e32273fb96edca69)
\| *	common/io: Limit the queue buffer size for fair scheduling via tevent	Amitay Isaacs	2013-08-22	1	-12/+31
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If we process all the data available in a socket buffer, CTDB can stay busy processing lots of packets via immediate event mechanism in tevent. After processing an immediate event, tevent returns without epoll_wait. So as long as there are immediate events, tevent will never poll other FDs. CTDB will report this as "Event handling took xx seconds" warning. This is misleading since CTDB is very busy processing packets, but never gets to the point of polling FDs. The improvement in socket handling made it worse when handling traverse control. There were lots of packets filled in the socket buffer quickly and CTDB stayed busy processing those packets and not polling other FDs and timer events. This can lead to controls timing out and in worse case other nodes marking busy node as disconnected. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 92939c1178d04116d842708bc2d6a9c2950e36cc)
\| *	Revert "common/io: Keep queue buffer size multiple of 4K"	Amitay Isaacs	2013-08-22	1	-22/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This reverts commit 5e9b1a7e24d058ff88aaa0563db36a804e866fa9. This is not the best approach. Allowing queue buffer size to grow indefinitely causes large number of CTDB packets to be queued up very quickly which when processed via immediate events will block CTDB from processing events from other FDs. If there are immediate events queued up, tevent will never process any of the FDs till all immediate events are processed. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit d8b094e804efc53fae9f44c6ef961b7b5797d290)
\| *	Revert "LACOUNT: Add back lacount mechanism to defer migrating a ↵	Amitay Isaacs	2013-08-22	6	-29/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	fetched/read copy until after default of 20 consecutive requests from the same node" This reverts commit 035c0d981bde8c0eee8b3f24ba8e2dc817e5b504. This is a premature optimization. Record can bounce between nodes very quickly if it is a contended record. There is no need to hold a record on a node unnecessarily. In case record contention becomes bad, enabling sticky records on a database is a better idea. Conflicts: include/ctdb_private.h server/ctdb_tunables.c Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit ac417b0003f0116f116834ad2ac51482d25cfa0d)
\| *	ctdbd: Print a log message when a key becomes hot	Amitay Isaacs	2013-08-22	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \|	Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 48f40985f4592c28402303ccbb458756f4914f75)
\| *	ctdbd: For volatile databases, write an empty record with rsn=0 only on dmaster	Amitay Isaacs	2013-08-22	1	-1/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Empty record with rsn=0 should not be written on any other node other than dmaster. This is however not true for persistent databases. So currently apply the check only for volatile databases. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit df83ae7a047dab4803e0d94b1c11df48ae17ca96)
\| *	tools/ctdb: Fix message in showban when node is banned	Martin Schwenke	2013-08-21	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \|	Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 5cdad2b8ebd71a5e458c301d00eac00a211feeb3)
\| *	tools/ctdb: Reimplement ban/unban using update_flags_wait_and_ipreallocate()	Martin Schwenke	2013-08-21	1	-61/+18
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This has the side effect of making these commands more resilient to control timeouts. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 0fe79662e20e347d9e1cb12a42cd356e33572402)
\| *	tools/ctdb: Factor out common pattern used in disable/enable/stop/continue	Martin Schwenke	2013-08-21	1	-119/+76
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Now we will only have one set of bugs. :-) Signed-off-by: Martin Schwenke <martin@meltin.net> Pair-programmed-with: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 444521c852749558f39dc6131acce9e47eefd489)
\| *	tools/ctdb: Factor, simplify and improve robustness of ipreallocate code	Martin Schwenke	2013-08-21	1	-45/+32
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Having other functions call control_ipreallocate() suggests that the it might look at the argv/argv arguments that are passed. This is not the case. Change the callers so they call the new ipreallocate() function instead. Broadcast CTDB_SRVID_TAKEOVER_RUN to all connected nodes. Inactive nodes will ignore it. This is safe since we only want 1 reply. If we didn't get a response, we don't actually care if there's no active recovery master - just fire, wait, retry, ... Ignore some failures on the basis that they might be transient, so it is probably worth retrying. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit 4bf0b1c9d21986eecb7682f935bd6154c65533cc)
\| *	tools/ctdb: Use ctdb_get_pnn() to get PNN of the current node	Martin Schwenke	2013-08-21	1	-29/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This has already been stored at connect time and can't fail. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit d8eb2e7fdd7645719370dad4f2faa5c3fffa8249)
\| *	util: In passing the code, fix a space vs. tab in set_close_on_exec().	Michael Adam	2013-08-19	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \|	Signed-off-by: Michael Adam <obnox@samba.org> (This used to be ctdb commit f9556a6f1fe0046308c8b363e6dcaf3f7ce6f2b7)
\| *	server: standardize formatting of comment block for ctdb_reply_dmaster() ↵	Michael Adam	2013-08-19	1	-6/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	while I'm at it.. Signed-off-by: Michael Adam <obnox@samba.org> (This used to be ctdb commit 00d3bf092e2f72eda330978c75ec85f17e870553)
\| *	server: fix wording and punctuation in comment block for ctdb_reply_dmaster().	Michael Adam	2013-08-19	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \|	Signed-off-by: Michael Adam <obnox@samba.org> (This used to be ctdb commit cb3a1c5af3b796dba30cae07118670d3c9e57df7)
\| *	recoverd: Improve log message when nodes disagree on recmaster	Amitay Isaacs	2013-08-14	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \|	Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 7b7aa7b599536cd60ebb84d363607bb4e953248a)
\| *	common: Null terminate process name string so valgrind doesn't complain	Amitay Isaacs	2013-08-14	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \|	Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 1c9025fdd08d1cea342af7487d0123015e08831b)
\| *	vacuuming: Fix vacuuming bug where requests keep bouncing between nodes (part 2)	Amitay Isaacs	2013-08-14	1	-20/+20
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is caused by corruption of a record header such that the records on two nodes point to each other as dmaster. This makes a request for that record bounce between nodes endlessly. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit f0853013655ac3bedf1b793de128fb679c6db6c6)
\| *	vacuuming: Fix vacuuming bug where requests keep bouncing between nodes (part 1)	Amitay Isaacs	2013-08-14	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is caused by corruption of a record header such that the records on two nodes point to each other as dmaster. This makes a request for that record bounce between nodes endlessly. Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit a610bc351f0754c84c78c27d02f9a695e60c5b0f)
\| *	db_wrap: Make sure tdb messages are logged correctly	Amitay Isaacs	2013-08-14	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \|	Signed-off-by: Amitay Isaacs <amitay@gmail.com> (This used to be ctdb commit 60cb40d090e45ff6134c098a238fac7ad854f134)
\| *	eventscripts: Become unhealthy faster on nfsd failure	Martin Schwenke	2013-08-14	4	-15/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Anecdotal evidence suggests that most nfsd RPC check failures are due to cluster filesystem or storage problem. Apparently these are rarely helped by attempting to restart the NFS service because the restart tends to hang. Fail after 2 nfsd RPC check failures, instead of waiting for 6 failures. Restart on every 10th failure to try to bring the node back to good health. Update unit tests to match. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit e9ef93f7b6dad59eabaa32124df81f3e74c651ef)
\| *	tools/ctdb: Increase default control timeout to 10 seconds	Martin Schwenke	2013-08-14	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The current 3 second timeout is arbitrary and users trip over it sometimes. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit b49c4f39666d5b1596213bf41bcdc47ed3c327ae)
\| *	eventscripts: Improve message logged when a counter hits a limit	Martin Schwenke	2013-08-14	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	It should print the actual number of consecutive failures rather than the limit. Signed-off-by: Martin Schwenke <martin@meltin.net> (This used to be ctdb commit ff5f0d1e29af2b293e30cdc54bed03a644be7038)