[Linux-cluster] gfs_controld[]: retrieve_plocks: ckpt open error 12 gfsmountpoint

Brett Cave brettcave at gmail.com
Tue Aug 19 08:14:24 UTC 2008


Seems like this is causing problems with the cluster - getting this on
1 node just before cluster hangs.
gfs_controld[]: retrieve_plocks: ckpt open error 12 gfs

The only reference i can find when googling this to plock.c
	rv = saCkptCheckpointOpen(ckpt_handle, &name, NULL,
				  SA_CKPT_CHECKPOINT_READ, 0, &h);
	if (rv == SA_AIS_ERR_TRY_AGAIN) {
		log_group(mg, "retrieve_plocks: ckpt open retry");
		sleep(1);
		goto open_retry;
	}
	if (rv != SA_AIS_OK) {
		log_error("retrieve_plocks: ckpt open error %d %s",
			  rv, mg->name);
		return;
	}

Not quite sure what CkptCheckpoint is, but from seeing the code from
ais, it seems to be some form of fault tolerance.
Found a post about a possible bug in the sackptCheckpointOpen function:
https://lists.linux-foundation.org/pipermail/openais/2006-September/008360.html


Have just installed newer versions of cman, gfs-utils, openais and
kmod-gfs, and upgraded kernel now, going to see if im still getting
hangs. been running for a few hours now with node resets and IO bursts
and seems to be behaving a little better.




More information about the Linux-cluster mailing list