View Full Version : HTML to TXT converter (slightly special)

February 2nd, 2010, 06:24

Admittedly, I am a bit embarrassed by this request, but can anyone point out a FREE or PAID html to txt converter?

Oh yes, I know there are milliards of the same but I have a special requirement -- it should have the capability to IGNORE the <BR> tag otherwise it wraps files where its not wanted. But it should NOT ignore the <P> tag, if you get what I mean.

Ah yes, I *could* theoretically, use sr.exe and replace all <BR> with NOTHING and THEN run any html to txt converter, but me being quite lazy, I'd love it if it were INBUILT, wot?

Looking at converting around 100K-120K, .html pages...

Thank you in advance.

Have Phun

February 2nd, 2010, 20:38

Me thinks you will not find a converter that has the ability to allow you to choose which tags to read and which to ignore.

Since I have had to spend a fair amount of time doing such things the hard way, I would gladly pay for such a program.


February 3rd, 2010, 01:08
Ideally an open source Php (or other) util would be nice. Php has a strip_tags function with an allowable_tags parameter to specify tags which should not be stripped.

Second to that, you could probably "convince" a simple converter to ignore <BR> tags. Hmm, I found a nice little "HTML to text freeware converter" that might fit the bill...from a reputable one-man company which has many other useful freeware utilities that you might recognize... donations accepted


push offset aBr ; "br"
push esi ; Str1
call _strcmpi
test eax, eax

February 3rd, 2010, 03:52
Thank you Kay. The quotes in your software was especially nice to give me the first hit in GOOG.

Have Phun

EDITED: around 10 minutes later...

And heck, it works too!

Kay, thanks. AGAIN!

Have Phun

