src/developer/shell/parser/README.md - fuchsia - Git at Google

 # Fuchsia Shell Parser

 Provides a basic parser for the shell grammar.

 The parser is built on combinators, which means the parser is built out of other parsers.

 A parser in the source appears as a function taking a single argument of type ParseResult and
 returning a value of type ParseResult. The argument is the result of parsing a prefix before the
 current parse. As an example, consider the grammar:

   S <- A B

 Assuming we had parser functions for A and B, named A() and B() respectively, we could implement S
 as follows:

   ParseResult S(ParseResult prefix) {
     return B(A(prefix));
   }

 A ParseResult represents a position in the string. If parsing fails completely, the returned
 ParseResult is equal to ParseResult::kEnd, which tests false when used as a boolean. Generally, we
 hope that a failed parse will be recovered, so we will always get a valid parse from our grammar,
 but it may or may not contain error nodes where we had to recover a failure.

 ParseResult contains three pieces of information: the "tail" of the parse, representing the
 un-parsed data remaining after the parse, various information comprising an "error score", which
 indicates how much error recovery was necessary to make the parse succeed, and a stack, which
 contains the AST nodes we have produced.

 To understand the stack, consider our A B example above. After successfully parsing S, the stack
 might contain two nodes:

 Node::A("A") Node::B("B")

 The ParseResult's "node" accessor would yield Node("B"), as it yields the top of the stack. We can
 call Reduce() on a ParseResult to collapse the stack into a single node, so if we called Reduce("S")
 on our result, we would end up with one node on the stack:

 Node::S(children = { Node(A) Node(B) })

 Reduce() will consume the entire stack by default. This is inconvenient for more complicated
 parsers. To make S safe, we would insert a "marker node" to indicate the point in the stack where
 Reduce() should stop consuming.

 ParseResult has methods to automatically push a marker node to each result it yields, or reduce
 every result it yields, so a better implementation of S would be:

   ParseResult S(ParseResult prefix) {
     return B(A(prefix.Mark())).Reduce<Node::S>();
   }

 This pattern is somewhat unwieldy, so we implement several combinator functions to produce it
 automatically. The NT ("NonTerminal") combinator takes a parser as input and automatically places a
 Mark() before it and a Reduce after it, while the Seq combinator runs two parsers in sequence and
 handles the ok flag correctly. With these tools, our S implementation becomes:

   ParseResult S(ParseResult prefix) {
     return NT<Node::S>(Seq(A, B));
   }

 ## Error handling in a nutshell

 TODO: Fill this out when the new error handling mechanism is in place.
	# Fuchsia Shell Parser

	Provides a basic parser for the shell grammar.

	The parser is built on combinators, which means the parser is built out of other parsers.

	A parser in the source appears as a function taking a single argument of type ParseResult and
	returning a value of type ParseResult. The argument is the result of parsing a prefix before the
	current parse. As an example, consider the grammar:

	S <- A B

	Assuming we had parser functions for A and B, named A() and B() respectively, we could implement S
	as follows:

	ParseResult S(ParseResult prefix) {
	return B(A(prefix));
	}

	A ParseResult represents a position in the string. If parsing fails completely, the returned
	ParseResult is equal to ParseResult::kEnd, which tests false when used as a boolean. Generally, we
	hope that a failed parse will be recovered, so we will always get a valid parse from our grammar,
	but it may or may not contain error nodes where we had to recover a failure.

	ParseResult contains three pieces of information: the "tail" of the parse, representing the
	un-parsed data remaining after the parse, various information comprising an "error score", which
	indicates how much error recovery was necessary to make the parse succeed, and a stack, which
	contains the AST nodes we have produced.

	To understand the stack, consider our A B example above. After successfully parsing S, the stack
	might contain two nodes:

	Node::A("A") Node::B("B")

	The ParseResult's "node" accessor would yield Node("B"), as it yields the top of the stack. We can
	call Reduce() on a ParseResult to collapse the stack into a single node, so if we called Reduce("S")
	on our result, we would end up with one node on the stack:

	Node::S(children = { Node(A) Node(B) })

	Reduce() will consume the entire stack by default. This is inconvenient for more complicated
	parsers. To make S safe, we would insert a "marker node" to indicate the point in the stack where
	Reduce() should stop consuming.

	ParseResult has methods to automatically push a marker node to each result it yields, or reduce
	every result it yields, so a better implementation of S would be:

	ParseResult S(ParseResult prefix) {
	return B(A(prefix.Mark())).Reduce<Node::S>();
	}

	This pattern is somewhat unwieldy, so we implement several combinator functions to produce it
	automatically. The NT ("NonTerminal") combinator takes a parser as input and automatically places a
	Mark() before it and a Reduce after it, while the Seq combinator runs two parsers in sequence and
	handles the ok flag correctly. With these tools, our S implementation becomes:

	ParseResult S(ParseResult prefix) {
	return NT<Node::S>(Seq(A, B));
	}

	## Error handling in a nutshell

	TODO: Fill this out when the new error handling mechanism is in place.