Getting all text nodes in xml using linq or xpath

I am trying to get all text nodes from any XML. Therefore, I need the code to be a bit dynamic.

Listed below are two different XML files that I need to process.

<questionset>

<!-- 1 -->
      <question id="1" returnOnIncorrect="false" vo="" img="">
         <![CDATA[
            ddddddddd?
         ]]>
    <!-- a -->
      <answer id="1" correct="0">
        <input>
          <![CDATA[
              eeeeeeeeee
          ]]>
        </input>
        <output>
          <![CDATA[
              iiiiiiiii.
          ]]>
        </output>
      </answer>
     </question>
</questionset>

Other XML

<xml>
  <content>
    <layout layoutId="0" copyFromId="0">
      <text indent="1" containerId="0">aaaaaaaa</text>
      <sound src="assets/SND_29c.mp3" />
      <img src="assets/IMG_29c.jpg" />
      <text indent="1" containerId="0">bbbbbbb</text>
      <sound src="assets/SND_29d.mp3" />
      <text indent="1" containerId="0">cccccccc</text>
    </layout>
  </content>
</xml>

I was wondering if it is possible to get all text nodes using XPath or Linq.

We tried using XPath with "// text ()", but this does not get text inside the CDATA nodes.

UPDATE PreserveWhitespace is True

Then I tried with Linq to use the following query with no luck:

var xxml = XElement.Load(this.m_folder + "\\" + item.DisplayName);                
var query =
           from e in xxml.Descendants()
           where e.NodeType==XmlNodeType.CDATA
           select e; 

Does anyone know how to solve this? Any answer would be greatly appreciated.

+4
source share
2 answers

LINQ to XML

XDocument doc = XDocument.Load("file.xml");
IEnumerable<XText> textNodes = doc.DescendantNodes().OfType<XText>();
+5

XPath //text(). XML , :

string xmlString = 
    "<questionset>\n" +
    "  <!-- 1 -->\n" +
    "  <question id=\"1\" returnOnIncorrect=\"false\" vo=\"\" img=\"\">\n" +
    "    <![CDATA[\n" +
    "      ddddddddd?\n" +
    "    ]]>\n" +
    "    <!-- a -->\n" +
    "    <answer id=\"1\" correct=\"0\">\n" +
    "      <input>\n" +
    "        <![CDATA[\n" +
    "          eeeeeeeeee\n" +
    "        ]]>\n" +
    "      </input>\n" +
    "      <output>\n" +
    "        <![CDATA[\n" +
    "          iiiiiiiii.\n" +
    "        ]]>\n" +
    "      </output>\n" +
    "    </answer>\n" +
    "  </question>\n" +
    "  <content>\n" +
    "    <layout layoutId=\"0\" copyFromId=\"0\">\n" +
    "      <text indent=\"1\" containerId=\"0\">aaaaaaaa</text>\n" +
    "      <sound src=\"assets/SND_29c.mp3\" />\n" +
    "      <img src=\"assets/IMG_29c.jpg\" />\n" +
    "      <text indent=\"1\" containerId=\"0\">bbbbbbb</text>\n" +
    "      <sound src=\"assets/SND_29d.mp3\" />\n" +
    "      <text indent=\"1\" containerId=\"0\">cccccccc</text>\n" +
    "    </layout>\n" +
    "  </content>\n" +
    "</questionset>";

var doc = new XmlDocument();
doc.LoadXml(xmlString);

var nodes = doc.SelectNodes("//text()");

foreach (XmlNode childrenNode in nodes) {
    Console.WriteLine("<text>" + childrenNode.SelectSingleNode(".").Value.Trim() + "</text>");
} 

:

<text>ddddddddd?</text>
<text>eeeeeeeeee</text>
<text>iiiiiiiii.</text>
<text>aaaaaaaa</text>
<text>bbbbbbb</text>
<text>cccccccc</text>

( Xamarin, Mac OS 10.9)

0

Source: https://habr.com/ru/post/1544515/


All Articles