ProductsDesktop Server For Scientific Computing For IBM POWER For IBM System z For SAP Business Applications Red Hat Network Satellite ManagementExtended Update Support High Availability High Performance Network Load Balancer Resilient Storage Scalable File System Smart Management Extended Lifecycle SupportWeb Server Developer Studio Portfolio Edition JBoss Operations Network FuseSource Integration Products Web Framework Kit Application Platform Data Grid Portal Platform SOA Platform Business Rules Management System (BRMS) Data Services Platform Messaging JBoss Community or JBoss enterprise
SolutionsApplication development Business process management Enterprise application integration Interoperability Operational efficiency Security VirtualizationMigrate to Red Hat Enterprise Linux Systems management Upgrading to Red Hat Enterprise Linux JBoss Enterprise Middleware IBM AIX to Red Hat Enterprise Linux HP-UX to Red Hat Enterprise Linux Solaris to Red Hat Enterprise Linux UNIX to Red Hat Enterprise Linux Start a conversation with Red Hat Migration services
TrainingPopular and new courses JBoss Middleware Administration curriculum Core System Administration curriculum JBoss Middleware Development curriculum Advanced System Administration curriculum Linux Development curriculum Cloud Computing and Virtualization curriculum
ConsultingStandard Operating Environment (SOE) Strategic Migration Planning Service-oriented architecture (SOA) Enterprise Data Solutions Business Process Management
September 28, 2006
- Music publishers seek to silence guitar tablature sites
- Making music
Fedora Core 5
- Jamendo: Music the way it was meant to be
- Edward Felten debunks DRM
- Introduction to web services
- Ask Shadowman
- More tips & tricks
- RSS how-to: Get your feed on
- Edward Felten defends your freedom to tinker
- Frysk: Debugging in real time
- Red Hat Speaks: Aaron Darcy and the application stack
- Fedora status report
- Tips & tricks
- >> more
Tips & tricks
Red Hat's customer service and support teams receive technical support questions from users all over the world. Red Hat technicians add the questions and answers to Red Hat Knowledgebase on a daily basis. Access to Red Hat Knowledgebase is free. Red Hat Magazine offers a preview into the Red Hat Knowledgebase by highlighting some of the most recent entries.
How does Red Hat Enterprise Linux 4 Update 4 and above support the machine check exceptions (MCE) in the revision F AMD Opteron Chip?
by Linda Wang
Red Hat Enterprise Release 4 Update 4 supports AMD Opteron Rev F MCE threadhold counters.
The Revision F of AMD Opteron processor adds support for MCE Threshold counters for DRAM. These counters allow a user with root access to specify a threshold of correctable ECCs that can be taken from the DRAM controller before an MCE is issued. This feature lets administrators of large server systems ignore infrequent ECC errors caused by cosmic radiation but be alerted via the MCE mechanism when a DRAM chip is failing.
Three major sections of note are:
- Sysfs Interfaces
With Update 4, the sysfs interface will be created under:
where, [i] refers to the CPU number the threshold registers is located and [j] refers to the MCA bank number the threshold register is under.
There may be up to five banks per CPU, however, currently only the fifth bank, MC4_MISC contains a valid threshold counter for DRAM ECC errors.
The following files will be created per valid threshold register:
error_count (R/W) - read: output the current error count in hex - write: reset the count interrupt_enable (R/W) - read: output 1 if interrupt enabled, else 0 - write: writing 0 will disable, non-0 will enable interrupts threshold_limit (R/W) - read: output the current threshold limit in hex - write: set a new threshold limit
The interrupt_enable may be changed without affecting the error_count. The threshold_limit may be changed without affecting the error_count if the new limit is not below the current error count. The threshold_limit must be: 0x0 < limit < 0xFFF.
When the error_count reaches the threshold_limit, the error_count will be fixed at the threshold_limit and will not increment any longer.
The user must reset the error_count in order for the counting to resume.
- Threshold Interrupt
When the error_count reaches the threshold_limit and if interrupt_enable is set, the processor will generate an interrupt with THRESHOLD_APIC_VECTOR. The driver will service the interrupt by simply logging the mcelog with a software defined bank number.
The mcelog resides in /dev/mcelog and can be read by the user, or a user-land program 'mcelog' which decodes various machine check exception dumps.
Example#1: decodes a fatal machine check exception message in ascii to stdout
bash# mcelog --k8 --ascii
Example#2: redirects output to system log
bash# mcelog --syslog --k8 /dev/mcelog
Refer to mcelog manpage for more detailed usage.
How do I find out if a given piece of PCI hardware is supposed to be taken care of by the current kernel?
by Pierre Amadio
Each pci device is flagged with a Constructor and a Model identifier that make them unique. A list of such id is available on sourceforge. A kernel module can detect the hardware it should handle using those identifier. A list of such id mapped to the module name is available in the modules.pcmimap file that comes with a given kernel (/lib/modules/`uname -r`/modules.pcimap).
Detecting the Constructor and Model vendor can be done with the lspci command. 'lspci' on its own will list the hardware available in human readable form. 'lspci -n' will gives the same list with the actual constructor and model id. Detect the piece of hardware that needs to be checked out with lspci, then find the line with the same pci id (the leftern collumn) in the output of 'lspci -n'. If there is a line related to this piece of hardware in modules.pcimap, chances are that this piece of hardware will be taken care of by the kernel.
Example Scenario: Will an Intel Corporation 82546GB Gigabit Ethernet Card be taken care of by a Red Hat Enterprise Linux 4 system running a 2.6.9-34.0.1.EL kernel?
lspci -n gives this result for the '02:01.0' device:
The constructor code is 8086 (Intel Corporation). The model is 1079 .
The /lib/modules/2.6.9-34.0.1.EL/modules.pcimap countain the following line that match those numbers:
Why does one get the warning "Cannot Preserve Ownership Error when Files are Moved to an NFS Mount Point" while trying to move a file to an NFS mount point?
by Archana Raghavan
This happens because commands like cp and mv try to copy ACL information. The warning is reported even though there may not be any ACL information to be transferred. This is because these programs cannot differentiate between a filesystem that doesn't support ACL, and one that does, but has ACL disabled. This warning is harmless and the file should be copied successfully.
Why am I getting SCSI errors when using device mapper multipathing with my MSA series and StorageWorks SAN?
by Josef Whiter
The MSA series and StorageWorks Storage Area Networks(SANs) are mostly active/passive SANs, which means that one path is always active, and the other path does not do anything until the active side fails. It is not possible to safely use device mapper multipathing on these SANs as it requires a special driver not provided with Red Hat Enterprise Linux 4. The vendor has a special firmware that will change the SAN to active/active which will allow it to work with device mapper multipathing. Please contact the vendor if you wish to use device mapper multipathing with this SAN.
The information provided in this article is for your information only. The origin of this information may be internal or external to Red Hat. While Red Hat attempts to verify the validity of this information before it is posted, Red Hat makes no express or implied claims to its validity.
This article is protected by the Open Publication License, V1.0 or later. Copyright © 2006 by Red Hat, Inc.