<?xml version="1.0" encoding="utf-8"?>
<?xml-stylesheet type="text/css" href="http://www.woodmann.com/collaborative/tools/skins/common/feed.css?97"?>
<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/">
	<channel>
		<title>Collaborative RCE Tool Library - Data Extraction Tools</title>
		<link>http://www.woodmann.com/collaborative/tools/index.php/Special:FeedListing/Data_Extraction_Tools/feed?feed_type=rss</link>
		<description>Update Notification Feed for Category: Data Extraction Tools</description>
		<language>en</language>
		<generator>MediaWiki 1.11.2 via dELTA feed generator</generator>
		<lastBuildDate>Fri, 03 Sep 2010 10:26:55 GMT</lastBuildDate>
		<item>
			<title>Tool Added: Pdf-parser</title>
			<link>http://www.woodmann.com/collaborative/tools/index.php/Pdf-parser</link>
			<description>&lt;P&gt;&lt;B&gt;Listed in categories:&lt;/B&gt;&amp;nbsp;&lt;I&gt;&lt;a href=&quot;http://www.woodmann.com/collaborative/tools/index.php/Category:Data_Extraction_Tools&quot;&gt;Data Extraction Tools&lt;/a&gt;&lt;/I&gt;&lt;/P&gt;&lt;p&gt;&lt;b&gt;Most recent version:&lt;/b&gt;&lt;br /&gt;
&lt;i&gt;0.3.1&lt;/i&gt;
&lt;/p&gt;&lt;p&gt;&lt;b&gt;Most recent release date:&lt;/b&gt;&lt;br /&gt;
&lt;i&gt;May 13, 2009&lt;/i&gt;
&lt;/p&gt;&lt;p&gt;&lt;b&gt;Description:&lt;/b&gt;&lt;br /&gt;
&lt;i&gt;This tool will parse a PDF document to identify the fundamental elements used in the analyzed file. It will not render a PDF document. The code of the parser is quick-and-dirty, I’m not recommending this as text book case for PDF parsers, but it gets the job done.&lt;br /&gt;&lt;br /&gt;The stats option display statistics of the objects found in the PDF document. Use this to identify PDF documents with unusual/unexpected objects, or to classify PDF documents. For example, I generated statistics for 2 malicious PDF files, and although they were very different in content and size, the statistics were identical, proving that they used the same attack vector and shared the same origin.&lt;br /&gt;&lt;br /&gt;The search option searches for a string in indirect objects (not inside the stream of indirect objects). The search is not case-sensitive, and is susceptible to the obfuscation techniques I documented (as I’ve yet to encounter these obfuscation techniques in the wild, I decided no to resort to canonicalization).&lt;br /&gt;&lt;br /&gt;filter option applies the filter(s) to the stream. For the moment, only FlateDecode is supported (e.g. zlib decompression).&lt;br /&gt;&lt;br /&gt;The raw option makes pdf-parser output raw data (e.g. not the printable Python representation).&lt;br /&gt;&lt;br /&gt;objects outputs the data of the indirect object which ID was specified. This ID is not version dependent. If more than one object have the same ID (disregarding the version), all these objects will be outputted.&lt;br /&gt;&lt;br /&gt;reference allows you to select all objects referencing the specified indirect object. This ID is not version dependent.&lt;br /&gt;&lt;br /&gt;type alows you to select all objects of a given type. The type is a Name and as such is case-sensitive and must start with a slash-character (/).&lt;/i&gt;
&lt;/p&gt;</description>
			<pubDate>Thu, 05 Nov 2009 01:26:57 GMT</pubDate>								</item>
		<item>
			<title>Tool Updated: Universal Extractor</title>
			<link>http://www.woodmann.com/collaborative/tools/index.php/Universal_Extractor</link>
			<description>&lt;P&gt;&lt;B&gt;Listed in categories:&lt;/B&gt;&amp;nbsp;&lt;I&gt;&lt;a href=&quot;http://www.woodmann.com/collaborative/tools/index.php/Category:Data_Extraction_Tools&quot;&gt;Data Extraction Tools&lt;/a&gt;&lt;/I&gt;&lt;/P&gt;&lt;p&gt;&lt;b&gt;Most recent version:&lt;/b&gt;&lt;br /&gt;
&lt;i&gt;1.6&lt;/i&gt;
&lt;/p&gt;&lt;p&gt;&lt;b&gt;Most recent release date:&lt;/b&gt;&lt;br /&gt;
&lt;i&gt;October 17, 2008&lt;/i&gt;
&lt;/p&gt;&lt;p&gt;&lt;b&gt;Description:&lt;/b&gt;&lt;br /&gt;
&lt;i&gt;Universal Extractor is a program do to exactly what it says: extract files from any type of archive, whether it's a simple zip file, an installation program, or even a Windows Installer (.msi) package.  This is still a work in progress (see details below), but so far it's proven quite useful and I feel others can also benefit from it.&lt;br /&gt;&lt;br /&gt;I should stress that this application is not intended to be a general purpose archiving program.  It will never replace WinRAR, 7-Zip, etc.  What it will do is allow you to extract files from virtually any type of archive, regardless of source, compression method, etc.  The original motivation behind this project was that I wanted an easy, convenient way to extract files from installation packages, such as Inno Setup or Windows Installer packages, without pulling up a command line every time.  In the process I got a little carried away and ended up throwing in support for every kind of archive format I possibly could find.&lt;/i&gt;
&lt;/p&gt;</description>
			<pubDate>Mon, 16 Mar 2009 17:36:34 GMT</pubDate>								</item>
		<item>
			<title>Tool Added: PDF Streams Inflater</title>
			<link>http://www.woodmann.com/collaborative/tools/index.php/PDF_Streams_Inflater</link>
			<description>&lt;P&gt;&lt;B&gt;Listed in categories:&lt;/B&gt;&amp;nbsp;&lt;I&gt;&lt;a href=&quot;http://www.woodmann.com/collaborative/tools/index.php/Category:Data_Extraction_Tools&quot;&gt;Data Extraction Tools&lt;/a&gt;&lt;/I&gt;&lt;/P&gt;&lt;p&gt;&lt;b&gt;Most recent version:&lt;/b&gt;&lt;br /&gt;

&lt;/p&gt;&lt;p&gt;&lt;b&gt;Most recent release date:&lt;/b&gt;&lt;br /&gt;
&lt;i&gt;September 24, 2008&lt;/i&gt;
&lt;/p&gt;&lt;p&gt;&lt;b&gt;Description:&lt;/b&gt;&lt;br /&gt;
&lt;i&gt;I've made this tool because PDF exploits (JavaScript inside PDF) are mostly hidden inside compressed streams.&lt;br /&gt;It will find and decompress zlib compressed streams. A new file per stream found will be created, where every such file will contain the decompressed stream content.&lt;br /&gt;&lt;br /&gt;Hope you will find it useful.&lt;br /&gt;&lt;br /&gt;I've made it in 3 hours (inclusive doing a research about streams in PDF files), so it may have some glitches and bugs. Please report your findings and/or suggestions.&lt;br /&gt;&lt;br /&gt;I have tested it on PDF files created by a couple of tools as there are some differences in created files between tools (different EOL, EOL after token identifier etc.)&lt;/i&gt;
&lt;/p&gt;</description>
			<pubDate>Mon, 29 Sep 2008 18:00:09 GMT</pubDate>								</item>
	</channel>
</rss>