This post originated from an RSS feed registered with Web Buzz
by Vineet Manohar.
Original Post: Access to www.w3.org DTDs blocked from Java
Feed Title: Vineet Manohar's blog » web 2.0
Feed URL: http://www.vineetmanohar.com/category/web20/feed/
Feed Description: Web 2.0: trends, tips and tricks! Google, Google analytics, Javascript, AJAX, SEO and more...
While parsing an XML file, I discovered that www.w3.org blocks requests to certain resources originating from the Java program, identified by the User-Agent. If your XML refers to w3.org DTDs, you might see an error like this: [java.io.IOException: Server returned HTTP response code: 500 for URL: http://www.w3.org/TR/xhtml1/DTD/xhtml-lat1.ent].
This is an known issue. The URLs have been deliberately blocked by w3.org due to 'abusive' using by Java programs.
Related posts: