Uploaded image for project: 'XWiki Platform'
  1. XWiki Platform
  2. XWIKI-5763

Remove entity encoding from UTF-8 text in XHTML

    XMLWordPrintable

Details

    • Improvement
    • Resolution: Fixed
    • Major
    • 2.7 RC1
    • 2.6
    • {Unused} Core
    • None
    • entity escape xhtml html utf utf8
    • Unknown

    Description

      Please, remove entity encoding if page is in UTF-8. Currently page body looks like this:

      <title>&#1055;&#1077;&#1088;&#1074;&#1099;&#1081; &#1087;&#1088;&#1086;&#1077;&#1082;&#1090; (Sandbox.&#1055;&#1077;&#1088;&#1074;&#1099;&#1081; &#1087;&#1088;&#1086;&#1077;&#1082;&#1090;) - XWiki</title>
      ...
      <meta name="document" content="Sandbox.&#1055;&#1077;&#1088;&#1074;&#1099;&#1081; &#1087;&#1088;&#1086;&#1077;&#1082;&#1090;"/>
      ...
      <meta name="page" content="&#1055;&#1077;&#1088;&#1074;&#1099;&#1081; &#1087;&#1088;&#1086;&#1077;&#1082;&#1090;"/>
      ...
      <span class="tmetype">Page: ...<strong>&#1055;&#1077;&#1088;&#1074;&#1099;&#1081; &#1087;&#1088;&#1086;&#1077;&#1082;&#1090;
      ...
      <div id="xwikicontent">
      <p>&#1055;&#1088;&#1086;&#1073;&#1072; &#1087;&#1077;&#1088;&#1072;....
      

      It makes page size three times bigger, complicates parsing. Also, some other parts of page use plain UTF-8 characters without entity encoding.

      Attachments

        Activity

          People

            sdumitriu Sergiu Dumitriu
            vanav Ivan Vanyushikn
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: