Context Navigation

VerticalHorizontalParsing

Timestamp:: Jun 20, 2010, 12:48:59 PM (14 years ago)
Author:: Christian Boos
Comment:: #ParsingOverview write down some notes

Legend:

: Unmodified
: Added
: Removed
: Modified

TracDev/Proposals/VerticalHorizontalParsing

-              v1
+              v2
 In future versions, the lists and other kind of markup could also benefit from this approach.
+== Parsing Overview
+Here's a very rough outline:
+ * `parse_vertical`
+   - prepares !WikiDocument (W)
+   - preprocess (r9868), split text in lines
+   - `parse_blocks` - get a tree of the `{{{` / `}}}` delimited blocks (B) and the spans between them (Raw); at this stage, the root document (W) and each (B) contains a list of (B|Raw) nodes
+   - for each wiki block (i.e. (W) and each (B) containing wiki text)
+     - `parse_raw_text` - each top-level (Raw) node will be scanned for structural ("vertical") markup; for each line:
+       * detect verbatim text ({{{`}}}...{{{`}}} and `{{{`...`}}}` sequences); remember verbatim spans for that line, escape the line (replaced by 'X')
+       * match vertical patterns which can result in:
+         - (I)tem node   (`- * 1.` etc.)
+         - (D)efinition list node (`... :: ...`)
+         - (Q)uote node (leading space)
+         - (C)itation node (`>+`)
+         - (Row) node (`|| ... ||`)
+         - ...
+         - if nothing matches, this is a plain (T)ext node
+       * at the end, this collection of (S)tructural nodes replace the (Raw) node
+     - `assemble_nodes` - each node had an indentation level, the first non-space character in its starting line; this information will enable us to re-arrange a list of (B) and (S) nodes according a logical nesting determined by the indentation
+ * `parse_horizontal` - each node in the previous tree will be split further, according to inline ("horizontal") markup
+   - some nodes won't have any text content to process
+   - some will have two (D) or more (Row)
+   - it can well be that some markup will need to be processed recursively (e.g. `[=#anchor ''this was already explained above'']`).
+   - macros could at this stage expand the tree as it's being built (e.g. via a new `IWikiMacroProvider.parse_macro` method)