Light-weight, simple and fast XML parser for C++ with XPath support

Find a file

Arseny Kapoulkine 5f996eba6d Do not emit surrounding whitespace for text nodes Previously we omitted extra whitespace for single PCDATA/CDATA children, but in mixed content there was extra indentation before/after text nodes. One of the problems with that is that the text that you saved is not exactly the same as the parsing result using default flags (parse_trim_pcdata helps). Another problem is that parse-format cycles do not have a fixed point for mixed content - the result expands indefinitely. Some XML libraries, like Python minidom, have the same issue, but this is definitely a problem. Pretty-printing mixed content is hard. It seems that the only other sensible choice is to switch mixed content nodes to raw formatting. In a way the code in this change is a weaker version of that - it removes indentation around text nodes but still keeps it around element siblings/children. Thus we can switch to mixed-raw formatting at some point later, which will be a superset of the current behavior. To do this we have to either switch at the first text node (.NET XmlDocument does that), or scan the children of each element for a possible text node and switch before we output the first child. The former behavior seems non-intuitive (and a bit broken); unfortunately, the latter behavior can cost up to 20% of the output time for trees without mixed content. Fixes #13.		2015-03-18 09:59:17 -07:00
contrib	contrib: Fix foreach.hpp for Boost 1.56.0	2014-09-27 04:49:27 +00:00
docs	docs: Change issue links to GitHub	2015-01-24 08:20:42 -08:00
scripts	Update version to 1.5	2014-11-17 21:47:37 -08:00
src	Do not emit surrounding whitespace for text nodes	2015-03-18 09:59:17 -07:00
tests	Do not emit surrounding whitespace for text nodes	2015-03-18 09:59:17 -07:00
.travis.yml	Revert "Change Travis config to build on Linux/OSX"	2014-11-06 16:40:35 +01:00
Jamfile.jam	Update version to 1.5	2014-11-17 21:47:37 -08:00
Jamrules.jam	docs: Regenerated documentation	2014-11-18 09:30:19 -08:00
Makefile	tests: Add support for afl-fuzz	2015-03-13 00:18:30 -07:00
README.md	Update version number in README.md	2014-11-27 00:12:42 -08:00
readme.txt	Update version to 1.5	2014-11-17 21:47:37 -08:00

README.md

pugixml

pugixml is a C++ XML processing library, which consists of a DOM-like interface with rich traversal/modification capabilities, an extremely fast XML parser which constructs the DOM tree from an XML file/buffer, and an XPath 1.0 implementation for complex data-driven tree queries. Full Unicode support is also available, with Unicode interface variants and conversions between different Unicode encodings (which happen automatically during parsing/saving).

pugixml is used by a lot of projects, both open-source and proprietary, for performance and easy-to-use interface.

Documentation

Documentation for the current release of pugixml is available on-line as two separate documents:

Quick-start guide, that aims to provide enough information to start using the library;
Complete reference manual, that describes all features of the library in detail.

You’re advised to start with the quick-start guide; however, many important library features are either not described in it at all or only mentioned briefly; if you require more information you should read the complete manual.

License

This library is available to anybody free of charge, under the terms of MIT License:

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

README.md Unescape Escape

pugixml

Documentation

License

README.md