[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]

Re: [Libguestfs] P2Vs seem to require a very robust Ethernet



> Greg, is this with Matt's latest F16 packages?

No.  Not yet.  I only wanted to change one thing at a time so right this second I'm still using Matt's special 0.8.4.1 from a few days ago.  But now I think I need to try again with 0.8.5 (I think that's the version.)

Trying to import that VM into RHEV keeps failing.  Every time I start an import, RHEV fails it about 10 minutes into the import.  Then both my Export domain and ISO domain go offline.  Curiously, the RHEV Export domain is now on that Fedora system and the ISO domain is still on the Storagetek.  But they both go offline every time I start a RHEV import.  

This gets even better.  I have 2 hosts named thing1 and thing2.  Host thing1 was the SPM (Storage Pool Manager), but apparently went offline.  Well, sort of.  Host thing1 had several VMs and I successfully migrated all of them to host thing2.  Then I rebooted thing1.  

Both hosts are HP Proliants and I can get at their consoles via an ILO.  Connecting to thing1 via ILO, it can no longer ping anywhere.  When thing2 tries to ping thing1, watching on thing1 with tcpdump, I see the echo requests come in on bridge device rhevm and on physical eth0.  But thing1 never sends any replies.  Listening in general with tcpdump on eth0, thing1 goes into PROMISC mode and I see traffic from all over the LAN.  But thing1 never sends anything out.  It listens but apparently won't talk.

Well, OK, so maybe I have a some kind of hardware problem with thing1. Bad motherboard?

But now host thing2 has all the VMs and the SPM role.  I should be able to do a RHEV import using thing2 even if thing1 has a problem.  But this consistently fails after about 10 minutes when RHEV-M logs a failure message and takes both the ISO and Export domains offline.  

Starting up a RHEV import and watching with tcpdump on thing2 - this is strange. Here is a sample.  Thing2 is 175.10.0.62 and the Fedora NFS server is 175.10.0.95. I wonder what ERR 1448 means?
.
.
.

09:51:05.213385 IP 175.10.0.62.1019 > 175.10.0.95.2049: . ack 53958549 win 24571 <nop,nop,timestamp 1083651691 57298290>
09:51:05.213501 IP 175.10.0.95.2049 > 175.10.0.62.1600941934: reply ERR 1448
09:51:05.213626 IP 175.10.0.95.2049 > 175.10.0.62.92276579: reply ERR 1448
09:51:05.213632 IP 175.10.0.62.1019 > 175.10.0.95.2049: . ack 53961445 win 24571 <nop,nop,timestamp 1083651692 57298290>
09:51:05.213748 IP 175.10.0.95.2049 > 175.10.0.62.1969553664: reply ERR 1448
09:51:05.213871 IP 175.10.0.95.2049 > 175.10.0.62.2675744636: reply ERR 1448
09:51:05.213878 IP 175.10.0.62.1019 > 175.10.0.95.2049: . ack 53964341 win 24571 <nop,nop,timestamp 1083651692 57298290>
09:51:05.213994 IP 175.10.0.95.2049 > 175.10.0.62.1366776504: reply ERR 1448
09:51:05.214117 IP 175.10.0.95.2049 > 175.10.0.62.1668246831: reply ERR 1448
09:51:05.214124 IP 175.10.0.62.1019 > 175.10.0.95.2049: . ack 53967237 win 24571 <nop,nop,timestamp 1083651692 57298290>
09:51:05.214240 IP 175.10.0.95.2049 > 175.10.0.62.1718183282: reply ERR 1448
09:51:05.214363 IP 175.10.0.95.2049 > 175.10.0.62.1997868874: reply ERR 1448
09:51:05.214374 IP 175.10.0.62.1019 > 175.10.0.95.2049: . ack 53970133 win 24571 <nop,nop,timestamp 1083651692 57298291>
09:51:05.214485 IP 175.10.0.95.2049 > 175.10.0.62.1903301888: reply ERR 1448
09:51:05.214610 IP 175.10.0.95.2049 > 175.10.0.62.1254097601: reply ERR 1448
09:51:05.214617 IP 175.10.0.62.1019 > 175.10.0.95.2049: . ack 53973029 win 24571 <nop,nop,timestamp 1083651693 57298291>
09:51:05.214733 IP 175.10.0.95.2049 > 175.10.0.62.16778812: reply ERR 1448
09:51:05.214856 IP 175.10.0.95.2049 > 175.10.0.62.524340: reply ERR 1448
09:51:05.214863 IP 175.10.0.62.1019 > 175.10.0.95.2049: . ack 53975925 win 24571 <nop,nop,timestamp 1083651693 57298291>
09:51:05.214978 IP 175.10.0.95.2049 > 175.10.0.62.2802450528: reply ERR 1448
09:51:05.215102 IP 175.10.0.95.2049 > 175.10.0.62.1952800512: reply ERR 1448
09:51:05.215108 IP 175.10.0.62.1019 > 175.10.0.95.2049: . ack 53978821 win 24571 <nop,nop,timestamp 1083651693 57298291>
09:51:05.215224 IP 175.10.0.95.2049 > 175.10.0.62.3376141: reply ERR 1448
09:51:05.215347 IP 175.10.0.95.2049 > 175.10.0.62.201329664: reply ERR 1448
09:51:05.215355 IP 175.10.0.62.1019 > 175.10.0.95.2049: . ack 53981717 win 24571 <nop,nop,timestamp 1083651693 57298291>
09:51:05.215470 IP 175.10.0.95.2049 > 175.10.0.62.1176568680: reply ERR 1448
09:51:05.215594 IP 175.10.0.95.2049 > 175.10.0.62.3541536: reply ERR 1448
09:51:05.215601 IP 175.10.0.62.1019 > 175.10.0.95.2049: . ack 53984613 win 24571 <nop,nop,timestamp 1083651694 57298292>
09:51:05.215717 IP 175.10.0.95.2049 > 175.10.0.62.3758274048: reply ERR 1448
09:51:05.215840 IP 175.10.0.95.2049 > 175.10.0.62.570522807: reply ERR 1448
09:51:05.215847 IP 175.10.0.62.1019 > 175.10.0.95.2049: . ack 53987509 win 24571 <nop,nop,timestamp 1083651694 57298292>
09:51:05.215962 IP 175.10.0.95.2049 > 175.10.0.62.2183504323: reply ERR 1448
09:51:05.216085 IP 175.10.0.95.2049 > 175.10.0.62.117456135: reply ERR 1448
09:51:05.216092 IP 175.10.0.62.1019 > 175.10.0.95.2049: . ack 53990405 win 24571 <nop,nop,timestamp 1083651694 57298292>
09:51:05.216209 IP 175.10.0.95.2049 > 175.10.0.62.41447938: reply ERR 1448
09:51:05.216332 IP 175.10.0.95.2049 > 175.10.0.62.74776836: reply ERR 1448
09:51:05.216341 IP 175.10.0.62.1019 > 175.10.0.95.2049: . ack 53993301 win 24571 <nop,nop,timestamp 1083651694 57298292>
09:51:05.216454 IP 175.10.0.95.2049 > 175.10.0.62.646972160: reply ERR 1448
09:51:05.216579 IP 175.10.0.95.2049 > 175.10.0.62.16784333: reply ERR 1448
09:51:05.216586 IP 175.10.0.62.1019 > 175.10.0.95.2049: . ack 53996197 win 24571 <nop,nop,timestamp 1083651695 57298293>
09:51:05.216701 IP 175.10.0.95.2049 > 175.10.0.62.1678844834: reply ERR 1448
09:51:05.216824 IP 175.10.0.95.2049 > 175.10.0.62.716701787: reply ERR 1448
09:51:05.216831 IP 175.10.0.62.1019 > 175.10.0.95.2049: . ack 53999093 win 24571 <nop,nop,timestamp 1083651695 57298293>
09:51:05.216947 IP 175.10.0.95.2049 > 175.10.0.62.1331588799: reply ERR 1448
09:51:05.217070 IP 175.10.0.95.2049 > 175.10.0.62.2164270270: reply ERR 1448
09:51:05.217077 IP 175.10.0.62.1019 > 175.10.0.95.2049: . ack 54001989 win 24571 <nop,nop,timestamp 1083651695 57298293>
09:51:05.217193 IP 175.10.0.95.2049 > 175.10.0.62.7307520: reply ERR 1448
09:51:05.217315 IP 175.10.0.95.2049 > 175.10.0.62.201331456: reply ERR 1448
09:51:05.217323 IP 175.10.0.62.1019 > 175.10.0.95.2049: . ack 54004885 win 24571 <nop,nop,timestamp 1083651695 57298293>
09:51:05.217438 IP 175.10.0.95.2049 > 175.10.0.62.891126785: reply ERR 1448
09:51:05.217563 IP 175.10.0.95.2049 > 175.10.0.62.655421: reply ERR 1448
09:51:05.217570 IP 175.10.0.62.1019 > 175.10.0.95.2049: . ack 54007781 win 24571 <nop,nop,timestamp 1083651696 57298293>
09:51:05.217685 IP 175.10.0.95.2049 > 175.10.0.62.1115904: reply ERR 1448
09:51:05.217808 IP 175.10.0.95.2049 > 175.10.0.62.1810654585: reply ERR 1448
09:51:05.217815 IP 175.10.0.62.1019 > 175.10.0.95.2049: . ack 54010677 win 24571 <nop,nop,timestamp 1083651696 57298294>
09:51:05.217935 IP 175.10.0.95.2049 > 175.10.0.62.3817660743: reply ERR 1448
09:51:05.218058 IP 175.10.0.95.2049 > 175.10.0.62.141783552: reply ERR 1448
09:51:05.218064 IP 175.10.0.62.1019 > 175.10.0.95.2049: . ack 54013573 win 24571 <nop,nop,timestamp 1083651696 57298294>
09:51:05.218182 IP 175.10.0.95.2049 > 175.10.0.62.101093888: reply ERR 1448
09:51:05.218304 IP 175.10.0.95.2049 > 175.10.0.62.11927648: reply ERR 1448
09:51:05.218315 IP 175.10.0.62.1019 > 175.10.0.95.2049: . ack 54016469 win 24571 <nop,nop,timestamp 1083651696 57298294>
09:51:05.218426 IP 175.10.0.95.2049 > 175.10.0.62.348889607: reply ERR 1448
09:51:05.218554 IP 175.10.0.95.2049 > 175.10.0.62.101450317: reply ERR 1448
09:51:05.218561 IP 175.10.0.62.1019 > 175.10.0.95.2049: . ack 54019365 win 24571 <nop,nop,timestamp 1083651697 57298294>
09:51:05.218677 IP 175.10.0.95.2049 > 175.10.0.62.2810254248: reply ERR 1448
09:51:05.218800 IP 175.10.0.95.2049 > 175.10.0.62.841483264: reply ERR 1448
09:51:05.218807 IP 175.10.0.62.1019 > 175.10.0.95.2049: . ack 54022261 win 24571 <nop,nop,timestamp 1083651697 57298295>
09:51:05.218923 IP 175.10.0.95.2049 > 175.10.0.62.43745795: reply ERR 1448
09:51:05.219046 IP 175.10.0.95.2049 > 175.10.0.62.1930551515: reply ERR 1448
09:51:05.219054 IP 175.10.0.62.1019 > 175.10.0.95.2049: . ack 54025157 win 24571 <nop,nop,timestamp 1083651697 57298295>
09:51:05.219168 IP 175.10.0.95.2049 > 175.10.0.62.617053963: reply ERR 1448
09:51:05.219292 IP 175.10.0.95.2049 > 175.10.0.62.159516990: reply ERR 1448
09:51:05.219299 IP 175.10.0.62.1019 > 175.10.0.95.2049: . ack 54028053 win 24571 <nop,nop,timestamp 1083651697 57298295>
09:51:05.219414 IP 175.10.0.95.2049 > 175.10.0.62.2390542669: reply ERR 1448
09:51:05.219539 IP 175.10.0.95.2049 > 175.10.0.62.70784577: reply ERR 1448
09:51:05.219546 IP 175.10.0.62.1019 > 175.10.0.95.2049: . ack 54030949 win 24571 <nop,nop,timestamp 1083651698 57298295>
09:51:05.219661 IP 175.10.0.95.2049 > 175.10.0.62.2399374851: reply ERR 1448
09:51:05.219791 IP 175.10.0.62.1019 > 175.10.0.95.2049: . ack 54033845 win 24571 <nop,nop,timestamp 1083651698 57298296>
09:51:05.220037 IP 175.10.0.62.1019 > 175.10.0.95.2049: . ack 54036741 win 24571 <nop,nop,timestamp 1083651698 57298296>
09:51:05.471134 IP 175.10.0.95.2049 > 175.10.0.62.1019: . ack 500424 win 2895 <nop,nop,timestamp 57298695 1083651909>

19667 packets captured
60835 packets received by filter
41168 packets dropped by kernel
[root thing2 ~]#




[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]