Re: [Libguestfs] hivexml - Flattened vs. Expanded XML

On Fri, Mar 19, 2010 at 01:45:34PM -0700, Simson Garfinkel wrote:
> Greetings. I am new to this mailing list. 
> We have been working with XML for digital forensics. One of the areas that we wish to create a schema for is the representation of registry entries.
> We are interested in hivexml as a tool for extracting the registry as an XML representation.

'hivexml' is really just a demo program that I wrote.  You can change
the XML format or even rewrite it -- the whole program is only 345
lines of code.


If you are interested in forensic analysis, it might be worth looking
at the analysis tools we wrote as well / instead:


These analysis tools look at the registry in much more detail and can
look for inconsistencies, hidden keys etc. which we don't care so much
about in the main hivex library.

One issue that may be of concern is string encoding in registry
values, which is not well defined.  Naturally for XML I suppose you'd
want to represent string values as UTF-8.  However it's almost
impossible to know for sure how strings are encoded in the registry,
so doing this conversion would either involve a heuristic, or you'd
have to store binary blobs in the XML (encoded as Base64 or as hex
digits).  The registry is a mess in this respect.

> You can find an example of the digital forensics XML at:
> http://www.forensicswiki.org/wiki/Fiwalk

Looks interesting.  It should be easily possible to get libguestfs to
write this format for disk images.  There is already a (trivial) demo
program I wrote along those lines:


- - -

If you have changes for libguestfs or hivex, please submit them to
this mailing list as for any open source project:



