Uploaded image for project: 'XWiki Platform'
  1. XWiki Platform
  2. XWIKI-5763

Remove entity encoding from UTF-8 text in XHTML

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 2.6
    • Fix Version/s: 2.7 RC1
    • Component/s: {Unused} Core
    • Labels:
      None
    • keywords:
      entity escape xhtml html utf utf8
    • Difficulty:
      Unknown
    • Similar issues:

      Description

      Please, remove entity encoding if page is in UTF-8. Currently page body looks like this:

      <title>&#1055;&#1077;&#1088;&#1074;&#1099;&#1081; &#1087;&#1088;&#1086;&#1077;&#1082;&#1090; (Sandbox.&#1055;&#1077;&#1088;&#1074;&#1099;&#1081; &#1087;&#1088;&#1086;&#1077;&#1082;&#1090;) - XWiki</title>
      ...
      <meta name="document" content="Sandbox.&#1055;&#1077;&#1088;&#1074;&#1099;&#1081; &#1087;&#1088;&#1086;&#1077;&#1082;&#1090;"/>
      ...
      <meta name="page" content="&#1055;&#1077;&#1088;&#1074;&#1099;&#1081; &#1087;&#1088;&#1086;&#1077;&#1082;&#1090;"/>
      ...
      <span class="tmetype">Page: ...<strong>&#1055;&#1077;&#1088;&#1074;&#1099;&#1081; &#1087;&#1088;&#1086;&#1077;&#1082;&#1090;
      ...
      <div id="xwikicontent">
      <p>&#1055;&#1088;&#1086;&#1073;&#1072; &#1087;&#1077;&#1088;&#1072;....
      

      It makes page size three times bigger, complicates parsing. Also, some other parts of page use plain UTF-8 characters without entity encoding.

        Attachments

          Activity

            People

            • Assignee:
              sdumitriu Sergiu Dumitriu
              Reporter:
              vanav Ivan Vanyushikn
            • Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:
                Date of First Response: