Details
-
Type:
Bug
-
Status: Closed
-
Priority:
Critical
-
Resolution: Solved By
-
Affects Version/s: 12.1
-
Fix Version/s: None
-
Component/s: XML
-
Labels:
-
Tests:Unit
-
Development Priority:High
-
Difficulty:Unknown
-
Documentation:N/A
-
Documentation in Release Notes:N/A
-
Similar issues:
Description
The following test added to DefaultHTMLCleanerTest shows the problem:
@Test public void preserveDoubleEscapingInAttributes() throws Exception { assertHTML("<div foo=\"&quot;\">content</div>", "<div foo=\"&quot;\">content</div>"); }
The HTML cleaner should preserve the double XML encoding in attribute values (the use case being attributes with XML value). Right now the following:
<div foo="&quot;">content</div>
ends up as:
<div foo=""">content</div>
which breaks any code that tries to parse the XML from the attribute value later on..
Attachments
Issue Links
- depends on
-
XCOMMONS-1938 Upgrade to HtmlCleaner 2.24
-
- Closed
-
- relates to
-
XCOMMONS-1740 Upgrade to HTML Cleaner 2.23
-
- Closed
-