Python Buzz Forum - New feature for the day

Articles |
News |
Weblogs |
Books |
Forums

Artima Forums | Articles | Weblogs | Java Answers | News

Sponsored Link •

Python Buzz Forum
New feature for the day

0 replies on 1 page.

Welcome Guest
Sign In

Back to Topic List

Reply to this Topic

Search Forum

Threaded View


Previous Topic		Next Topic

Flat View: This topic has 0 replies on 1 page

Ian Bicking

Posts: 900
Nickname: ianb
Registered: Apr, 2003

Ian Bicking is a freelance programmer

New feature for the day

Posted: Apr 14, 2004 10:06 AM

This post originated from an RSS feed registered with Python Buzz by Ian Bicking.
Original Post: New feature for the day Feed Title: Ian Bicking Feed URL: http://www.ianbicking.org/feeds/atom.xml Feed Description: Thoughts on Python and Programming.	Latest Python Buzz Posts Latest Python Buzz Posts by Ian Bicking Latest Posts From Ian Bicking

The new feature in the wiki for today is version comparisons.

This uses htmldiff.py to do the comparisons. Unlike some other comparisons, htmldiff calculates the differences between HTML documents, instead of relying on line-by-line comparisons of the original text source. Since HTML isn't (very) whitespace sensitive, comparisons based on line endings or other whitespace aren't really accurate. Instead htmldiff parses the HTML into a list of tokens -- one token for each start and end tag, and one token for each whitespace-delimited word in the text (it essentially ignores the nested structure of HTML and treats it as a simple stream of tokens).

This seems like a good compromise to me. Character level comparisons ignore the structure of HTML completely, and tend to create weird differences. Line level comparisons aren't appropriate to HTML or narrative text. Structured comparisons like XmlDiff are too complicated to present in a visually simple way.

Read: New feature for the day

Previous Topic

Next Topic


	Web Artima.com