-
Notifications
You must be signed in to change notification settings - Fork 0
Abstract Pattern Tree
The Abstract Pattern Tree (APT) is a graph-based description of the algorithmic structure of source code. In general, we can generate an applicable APT from a given Abstract Syntax Tree (AST), if we have appropriate knowledge on the on the source language. In this case we utilized the Monticore framework to create a Domain Specific Language (DSL) for creating a prototype with sufficient knowledge on the source code. Furthermore, Monticore generates the AST for sources in our language. Since, we aim to utilize the APT as an Intermediate Representation (IR) the implementation contains a lot more nodes, than necessary for the optimization of the parallel patterns. In the following, the different Pattern-Nodes will be introduced. For traversing the APT a visitor can be used. The definition of data elements can be found here.
The generation of the APT based on the AST is explained here.
The most basic node within an APT is the Pattern Node, which will be extended by all following nodes.
Abstract definition of function declarations, will be extended by the function nodes. Function nodes do not have a parent node.
Abstract definition of parallel function definitions, will be extended by the individual parallel nodes.
The map node describes the (parallel) map pattern and extends the parallel node. This node contains information on the computation of a single data element from a one dimensional input/output.
The reduce node describes the (parallel) reduction pattern and extends the parallel node. This node contains information on the computation of a single data element from a one dimensional input as well as the recombination function, defining the reduction step.
The stencil node describes the (parallel) stencil pattern and extends the parallel node. This node contains information on the computation of a single data element from a multidimensional input/output
The dynamic progamming node defines the (parallel) dynamic programming pattern. This node contains information on the computation of a single data element for any given timestep.
The serial node describes function definition without explicit parallelism, extends the function node.
The main node is a serial node describing the programm entry point.
Other than function nodes there are also pattern nodes, which describe controll statement and actions. Those nodes directly inherit the parameters from pattern node.
The branch node is a pattern node that signals a branching behaviour at this step. The child nodes it contains are all branch case nodes
The branch case node describes an idividual path in the current branch. If the parameter hasCondition is true, the first element in the children list is the conditional expression for this path. A value of false in the hasCondition parameter means that the path is chosen, if all others are not selected.
The call node defines a function call within an expression. This node acts as an indirect conection between the call and the function definition. The arguments for the call are defined in the parent expression.
The parallel call node defines the call of a parallel pattern. It extends the call node, therefore, building the connection between the call and the pattern definition via the identifier. The first childnodes are always the additional arguments. The number of these additional arguments is defined in additionalArgumentCount. The following child defines the assignment from the pattern-call to the resulting variable.
The individual values within the additional arguments are stored in either a MetaValue or a MetaList and currently are only of type Integer. The corresponding value list can be accessed with getValue() or getValues(). Different parallel patterns contain differnt additional arguments:
-
Map:
- The first additional argument of the map pattern is a MetaValue containing the width(number of instances) of the pattern
- The second additional argument contains a MetaValue containing the start offset(the first value for index) of a map.
-
Reduction:
- The first additional argument of the reduction pattern is a MetaList containing the following values in the given order:
- width of the reduction
- arity of the combiner function
- depth of the reduction
- start offset of the reduction
- The first additional argument of the reduction pattern is a MetaList containing the following values in the given order:
-
Dynamic Programming:
- The first additional argument of the dynamic programming pattern is a MetaValue containing the number of timesteps
- The second argument is a MetaValue containing the width of the "Map" within the dynamic programming recursion.
- The third argument is a MetaList containing the start offsets for the timestep and map iterations.
-
Stencil:
- The first additional argument of the stencil pattern is a MetaList containing the widths over each dimension starting with the first INDEX variable in accordance to the language.
- The second additional argument is a MetaList containing the start offsets for each dimension in the same order as the width.
Loop nodes contain the abstract definition for 3 types of loop statements: for-loops, for-each-loops and while-loops.
The for-loop node describes the common for loop, where the child nodes are executed for each iteration. For this node the first 3 child nodes are dedicated to controlling the loop.
- The initialization of the loop control-variable.
- The loop-condition, to test whether the loop-variable is still within the defined range.
- The update rule for the loop-variable, after every iteration.
The for-each-loop node describes the for loop which iterates over all elements of a given arry. For this node the first child node is dedicated to the expression defining the array.
The while-loop node describes the while loop which executes the given child nodes as long a given expression is evaluated to true. For this node the first child node is dedicated to the run condition (expression).
Complex expression nodes are nodes containing a single expression. This expression can potentially contain function calls. If the expression contains function calls they are replaced by function-inline-data elements, which are variables representing a single function call.
The simple expression block node is a leaf node containing a set of expressions without any function calls.
The return node defines the function exit point. The only child node is the expression defining the result of the function.