OpenXML Filter: improve Word styles optimisation for the reiterated properties on the directly applied formatting level

Please consider the following case. There is a paragraph with 2 runs, the second of which reiterates the rFonts property, which is present under the default paragraph style applied for that run already on the paragraph level.

<w:p w:rsidR="00D577F2" w:rsidRPr="0076395F" w:rsidRDefault="0076395F">
    <w:pPr>
        <w:rPr>
            <w:lang w:val="en-US"/>
        </w:rPr>
    </w:pPr>
    <w:r w:rsidRPr="00D00CAC">
        <w:rPr>
            <w:sz w:val="24"/>
            <w:szCs w:val="24"/>
            <w:lang w:val="en-US"/>
        </w:rPr>
        <w:t>Run 1.</w:t>
    </w:r>
    <w:r w:rsidR="004629C4" w:rsidRPr="00D00CAC">
        <w:rPr>
            <w:rFonts w:ascii="Arial" w:hAnsi="Arial" w:cs="Arial"/>
            <w:lang w:val="en-US"/>
        </w:rPr>
        <w:t>Run 2.</w:t>
    </w:r>

<w:style w:type="paragraph" w:default="1" w:styleId="Normal">
    <w:name w:val="Normal"/>
    <w:qFormat/>
    <w:rPr>
        <w:rFonts w:ascii="Arial" w:hAnsi="Arial" w:cs="Arial"/>
    </w:rPr>
</w:style>

Currently, this is extracted as

<trans-unit id="NFDBB2FA9-tu1" xml:space="preserve">
    <source xml:lang="en"><g id="1">Run 1.</g><g id="2">Run 2.</g></source>
    <target xml:lang="fr"><g id="1">Run 1.</g><g id="2">Run 2.</g></target>
</trans-unit>

It would be nice to have the following extraction:

<trans-unit id="NFDBB2FA9-tu1" xml:space="preserve">
    <source xml:lang="en"><g id="1">Run 1.</g>Run 2.</source>
    <target xml:lang="fr"><g id="1">Run 1.</g>Run 2.</target>
</trans-unit>

For more information please refer to the attached document.

Comments (6)