accelforge.frontend package#
Subpackages#
- accelforge.frontend.mapper package
- Submodules
- accelforge.frontend.mapper.ffm module
FFMFFM.force_memory_hierarchy_orderFFM.info_metricsFFM.max_fused_loopsFFM.max_fused_loops_per_rank_variableFFM.max_loopsFFM.max_loops_minus_ranksFFM.max_pmapping_templates_per_einsumFFM.memory_limitFFM.memory_limit_per_processFFM.metricsFFM.out_of_order_hierarchy_explore_removing_spatials_for_more_temporalsFFM.time_limitFFM.time_limit_per_pmapping_template
- accelforge.frontend.mapper.mapper module
- accelforge.frontend.mapper.metrics module
- Module contents
FFMFFM.force_memory_hierarchy_orderFFM.info_metricsFFM.max_fused_loopsFFM.max_fused_loops_per_rank_variableFFM.max_loopsFFM.max_loops_minus_ranksFFM.max_pmapping_templates_per_einsumFFM.memory_limitFFM.memory_limit_per_processFFM.metricsFFM.out_of_order_hierarchy_explore_removing_spatials_for_more_temporalsFFM.time_limitFFM.time_limit_per_pmapping_template
MapperMetrics
- accelforge.frontend.mapping package
- Submodules
- accelforge.frontend.mapping.mapping module
- Module contents
Submodules#
accelforge.frontend.arch module#
- class accelforge.frontend.arch.Action[source]#
Bases:
EvalableModelAn action that may be performed by a component.
- energy: EvalsTo[int | float | None]#
Dynamic energy of this action. Per-action energy is multiplied by the component’s energy_scale and the action’s energy_scale.
- energy_scale: EvalsTo[int | float]#
The scale factor for dynamic energy of this action. Multiplies this action’s energy by this value.
- extra_attributes_for_component_model: EvalExtras#
Extra attributes to pass to the component model. In addition to all attributes of this action, any extra attributes will be passed to the component model as arguments to the component model’s action. This can be used to define attributes that are known to the component model, but not accelforge, such as clock frequency.
- latency: EvalsTo[int | float | None]#
Latency of this action. Per-action latency is multiplied by the component’s latency_scale and the action’s latency_scale.
- class accelforge.frontend.arch.Arch[source]#
Bases:
HierarchicalTop-level architecture specification.
All attributes in the architecture can refrence variables in the spec-level variables field as well as symbols from the individual Einsum being processed.
- extra_attributes_for_all_component_models: EvalExtras#
Extra attributes to pass to all component models. This can be used to pass global attributes, such as technology node or clock period, to every component model.
- property per_component_total_area: dict[str, float]#
Returns the total area used by each component in the architecture in m^2.
- property per_component_total_leak_power: dict[str, float]#
Returns the total leak power of each component in the architecture in W.
- property total_area: float#
Returns the total area of the architecture in m^2.
- Returns:
The total area of the architecture in m^2.
- Return type:
- property total_leak_power: float#
Returns the total leak power of the architecture in W.
- Returns:
The total leak power of the architecture in W.
- Return type:
- variables: EvalExtras#
Like the spec-level variables field, this field is evaluated first and its contents can be referenced elsewhere in the architecture. Unlike the spec-level variables field, this, like ther rest of the architecture, is evaluated per-Einsum and can reference Einsum-specific symbols.
- class accelforge.frontend.arch.ArchNode[source]#
Bases:
EvalableModelA node in the architecture.
- find(name)[source]#
Finds a Leaf node with the given name.
- Raises:
ValueError – If the Leaf node with the given name is not found.
- Return type:
- class accelforge.frontend.arch.Branch[source]#
Bases:
ArchNode- __init__(*args, **kwargs)#
Create a new model by parsing and validating input data from keyword arguments.
Raises [ValidationError][pydantic_core.ValidationError] if the input data cannot be validated to form a valid model.
self is explicitly positional-only to allow self as a field name.
- nodes: ArchNodes[Annotated[Annotated[Compute, Tag(tag=Compute)] | Annotated[Memory, Tag(tag=Memory)] | Annotated[Toll, Tag(tag=Toll)] | Annotated[Fanout, Tag(tag=Fanout)] | Annotated[_Parallel, Tag(tag=_Parallel)] | Annotated[Hierarchical, Tag(tag=Hierarchical)] | Annotated[Fork, Tag(tag=Fork)], Discriminator(discriminator=_get_tag, custom_error_type=None, custom_error_message=None, custom_error_context=None)]]#
- class accelforge.frontend.arch.Comparison[source]#
Bases:
EvalableModelA comparison between a rank variable’s bound and a value. A comparison is performed for each rank variable.
The LHS of each comparison is the loop bound of a loop that affects this rank variable. The RHS is the given value.
For example, if the expression resolves to [a, b], the operator is “<=”, and the value is 10, and we have loops “for a0 in [0..A0)” and “for b0 in [0..B0)”, then a mapping is only valid if A0 <= 10 and B0 <= 10.
- expression: TryEvalTo[InvertibleSet[str]]#
The expression to compare. This expression should resolve to a set of rank variables. A comparison is performed for each rank variable independently, and the result passes if and only if all comparisons pass. The LHS of each comparison is the loop bound of a loop that affects this rank variable. The RHS is the given value.
- operator: str#
The operator to use for the comparison. Supported operators are: - == (equal to) - <= (less than or equal to) - >= (greater than or equal to) - < (less than) - > (greater than) - product== (product of all loop bounds is equal to) - product<= (product of all loop bounds is less than or equal to) - product>= (product of all loop bounds is greater than or equal to) - product< (product of all loop bounds is less than) - product> (product of all loop bounds is greater than)
- class accelforge.frontend.arch.Component[source]#
Bases:
LeafA component object in the architecture. This is overridden by different component types, such as Memory and Compute.
- __init__(*args, **kwargs)#
Create a new model by parsing and validating input data from keyword arguments.
Raises [ValidationError][pydantic_core.ValidationError] if the input data cannot be validated to form a valid model.
self is explicitly positional-only to allow self as a field name.
- area: EvalsTo[int | float | None]#
The area of a single instance of this component in m^2. If set, area calculations will use this value.
- area_scale: EvalsTo[int | float]#
The scale factor for the area of this comxponent. This is used to scale the area of this component. For example, if the area is 1 m^2 and the scale factor is 2, then the area is 2 m^2.
- calculate_action_energy(component_models=None, in_place=False)[source]#
Calculates energy for each action of this component. If energy is set in the action or component (with action taking precedence), that value will be used. Otherwise, the energy will be calculated using hwcomponents. Populates, for each action, the
<action>.energyand field. Extends thecomponent_modeling_logfield with log messages.Uses the
component_modelattribute, or, if not set, thecomponent_classattribute to find the model and populate thecomponent_modelattribute.Note that these methods will be called by the Spec when calculating energy and area. If you call them yourself, note that string expressions may not be evaluated because they need the Spec’s global scope. If you are sure that all necessary values are present and not a result of an expression, you can call these directly. Otherwise, you can call the
Spec.calculate_component_area_energy_latency_leakand then grab components from the returnedSpec.- Parameters:
component_models (list[hwcomponents.ComponentModel] | None) – The models to use for energy calculation. If not provided, the models will be found with hwcomponents.get_models().
in_place (bool) – If True, the component will be modified in place. Otherwise, a copy will be returned.
- Returns:
A copy of the component with the calculated energy.
- Return type:
- calculate_action_latency(component_models=None, in_place=False)[source]#
Calculates the latency for each action by this component. Populates the
<action>.latencyfield. Extends thecomponent_modeling_logfield with log messages.- Parameters:
component_models (list[hwcomponents.ComponentModel] | None) – The models to use for latency calculation. If not provided, the models will be found with hwcomponents.get_models().
in_place (bool) – If True, the component will be modified in place. Otherwise, a copy will be returned.
- Returns:
A copy of the component with the calculated latency for each action.
- Return type:
Self
- calculate_area(component_models=None, in_place=False)[source]#
Calculates the area for this component. If area is set in the component, that value will be used. Otherwise, the area will be calculated using the hwcomponents library. Populates
areafield. Extends thecomponent_modeling_logfield with log messages.Uses the
component_modelattribute, or, if not set, thecomponent_classattribute to find the model and populate thecomponent_modelattribute.Note that these methods will be called by the Spec when calculating energy and area. If you call them yourself, note that string expressions may not be evaluated because they need the Spec’s global scope. If you are sure that all necessary values are present and not a result of an expression, you can call these directly. Otherwise, you can call the
Spec.calculate_component_area_energy_latency_leakand then grab components from the returnedSpec.- Parameters:
component_models (list[hwcomponents.ComponentModel] | None) – The models to use for area calculation. If not provided, the models will be found with hwcomponents.get_models().
in_place (bool) – If True, the component will be modified in place. Otherwise, a copy will be returned.
- Returns:
A copy of the component with the calculated area.
- Return type:
Self
- calculate_area_energy_latency_leak(component_models=None, in_place=False, _use_cache=False)[source]#
Calculates the area, energy, latency, and leak power for this component. Populates the
area,total_area,leak_power,total_leak_power,total_latency, andcomponent_modeling_logfields of this component. Additionally, for each action, populates the<action>.area,<action>.energy,<action>.latency, and<action>.leak_powerfields. Extends thecomponent_modeling_logfield with log messages.Note that these methods will be called by the Spec when calculating energy and area. If you call them yourself, note that string expressions may not be evaluated because they need the Spec’s global scope. If you are sure that all necessary values are present and not a result of an expression, you can call these directly. Otherwise, you can call the
Spec.calculate_component_area_energy_latency_leakand then grab components from the returnedSpec.- Parameters:
component_models (list[hwcomponents.ComponentModel] | None) – The models to use for energy calculation. If not provided, the models will be found with hwcomponents.get_models().
in_place (bool) – If True, the component will be modified in place. Otherwise, a copy will be returned.
_use_cache (bool) – If True, the component model will be cached and reused if the same component class, attributes, and actions are provided. Note that this may return copies of the same object across multiple calls.
- Returns:
The component with the calculated energy, area, and leak power.
- Return type:
Self
- calculate_leak_power(component_models=None, in_place=False)[source]#
Calculates the leak power for this component. If leak power is set in the component, that value will be used. Otherwise, the leak power will be calculated using hwcomponents. Populates
leak_powerfield. Extends thecomponent_modeling_logfield with log messages.Uses the
component_modelattribute, or, if not set, thecomponent_classattribute to find the model and populate thecomponent_modelattribute.Note that these methods will be called by the Spec when calculating energy and area. If you call them yourself, note that string expressions may not be evaluated because they need the Spec’s global scope. If you are sure that all necessary values are present and not a result of an expression, you can call these directly. Otherwise, you can call the
Spec.calculate_component_area_energy_latency_leakand then grab components from the returnedSpec.- Parameters:
component_models (list[hwcomponents.ComponentModel] | None) – The models to use for energy calculation. If not provided, the models will be found with hwcomponents.get_models().
in_place (bool) – If True, the component will be modified in place. Otherwise, a copy will be returned.
- Returns:
A copy of the component with the calculated energy.
- Return type:
Self
- component_class: str | None#
The class of this Component. Used if an energy or area model needs to be called for this Component.
- component_model: ComponentModel | None#
The model to use for this Component. If not set, the model will be found with hwcomponents.get_models(). If set, the component_class will be ignored.
- enabled: TryEvalTo[bool]#
Whether this component is enabled. If the expression resolves to False, then the component is disabled. This is evaluated per-pmapping-template, so it is a function of the tensors in the current Einsum. For example, you may say len(All) >= 3 and the component will only be enabled with Einsums with three or more tensors.
- energy_scale: EvalsTo[int | float]#
The scale factor for dynamic energy of this component. For each action, multiplies this action’s energy. Multiplies the calculated energy of each action.
- extra_attributes_for_component_model: _ExtraAttrs#
Extra attributes to pass to the component model. In addition to all attributes of this component, any extra attributes will be passed to the component model. This can be used to define attributes that are known to the component model, but not accelforge, such as the technology node.
- get_component_class(trying_to_calculate=None)[source]#
Returns the class of this Component.
- Parameters:
trying_toeval (str, optional) – What was trying to be calculated using this component. If provided, the error message will be more specific.
- Raises:
EvaluationError – If the component_class is not set.
- Return type:
- latency_scale: EvalsTo[int | float]#
The scale factor for the latency of this component. This is used to scale the latency of this component. For example, if the latency is 1 ns and the scale factor is 2, then the latency is 2 ns. Multiplies the calculated latency of each action.
- leak_power: EvalsTo[int | float | None]#
The leak power of a single instance of this component in W. If set, leak power calculations will use this value.
- leak_power_scale: EvalsTo[int | float]#
The scale factor for the leak power of this component. This is used to scale the leak power of this component. For example, if the leak power is 1 W and the scale factor is 2, then the leak power is 2 W.
- n_parallel_instances: EvalsTo[int | float]#
The number of parallel instances of this component. Increasing parallel instances will proportionally increase area and leakage, while reducing latency (unless latency calculation is overridden).
- populate_component_model(component_models=None, in_place=False, trying_to_calculate=None)[source]#
Populates the
component_modelattribute with the model for this component. Extends thecomponent_modeling_logfield with log messages. Uses thecomponent_classattribute to find the model and populate thecomponent_modelattribute. Uses thehwcomponents.get_model()function to find the model.- Parameters:
component_models (list[hwcomponents.ComponentModel] | None) – The models to use for energy calculation. If not provided, the models will be found with hwcomponents.get_models().
in_place (bool) – If True, the component will be modified in place. Otherwise, a copy will be returned.
trying_to_calculate (str, optional) – What was trying to be calculated using this component. If provided, the error messages for missing component_class will be more specific.
- Returns:
A copy of the component with the populated
component_modelattribute.- Return type:
- total_area: EvalsTo[int | float | None]#
The total area of all instances of this component in m^2. Do not set this value. It is calculated when the architecture’s area is calculated.
- total_latency: str | int | float#
An expression representing the total latency of this component in seconds. This is used to calculate the latency of a given Einsum. Special variables available are the following:
min: The minimum value of all arguments to the expression.
max: The maximum value of all arguments to the expression.
sum: The sum of all arguments to the expression.
X_actions: The number of times action X is performed. For example, read_actions is the number of times the read action is performed.
X_latency: The total latency of all actions of type X. For example, read_latency is the total latency of all read actions. It is equal to the per-read latency multiplied by the number of read actions.
action2latency: A dictionary of action names to their latency.
Additionally, all component attributes are availble as variables, and all other functions generally available in parsing. Note this expression is evaluated after other component attributes are evaluated.
For example, the following expression calculates latency assuming that each read or write action takes 1ns:
1e-9 * (read_actions + write_actions).
- class accelforge.frontend.arch.Fanout[source]#
Bases:
LeafCreates a spatial fanout, and doesn’t do anything else.
- class accelforge.frontend.arch.Fork[source]#
Bases:
HierarchicalA Fork is a Hierarchical that branches off from the main path. The nodes inside the Fork are a separate branch, while the main path continues to the next sibling after the Fork.
- class accelforge.frontend.arch.Leaf[source]#
Bases:
ArchNodeA leaf node in the architecture. This is an abstract class that represents any node that is not a Branch.
- __init__(*args, **kwargs)#
Create a new model by parsing and validating input data from keyword arguments.
Raises [ValidationError][pydantic_core.ValidationError] if the input data cannot be validated to form a valid model.
self is explicitly positional-only to allow self as a field name.
- spatial: EvalableList[Spatial]#
The spatial fanouts of this Leaf.
Spatial fanouts describe the spatial organization of components in the architecture. A spatial fanout of size N for this node means that there are N instances of this node. Multiple spatial fanouts lead to a multi-dimensional fanout. Spatial constraints apply to the data exchange across these instances. Spatial fanouts specified at this level also apply to lower-level Leaf nodes in the architecture.
- class accelforge.frontend.arch.Memory[source]#
Bases:
TensorHolderA Memory is a TensorHolder that stores data over time, allowing for temporal reuse.
- actions: EvalableList[TensorHolderAction]#
The actions that this Memory can perform.
- class accelforge.frontend.arch.Spatial[source]#
Bases:
EvalableModelA one-dimensional spatial fanout in the architecture.
- loop_bounds: EvalableList[Comparison]#
Bounds for loops over this dimension. This is a list of
Comparisonobjects, all of which must be satisfied by the loops to which this constraint applies.Note: Loops may be removed if they are constrained to only one iteration.
- may_reuse: TryEvalTo[InvertibleSet[str]]#
The tensors that can be reused spatially across instances of this fanout. This expression will be evaluated for each mapping template.
- min_usage: int | float | str#
The minimum usage of spatial instances, as a value from 0 to 1. A mapping is invalid if less than this porportion of this dimension’s fanout is utilized. Mappers that support it (e.g., FFM) may, if no mappings satisfy this constraint, return the highest-usage mappings.
- power_gateable: EvalsTo[bool]#
Whether this spatial fanout has power gating. If True, then unused spatial instances will be power gated if not used by a particular Einsum.
- reuse: TryEvalTo[InvertibleSet[str]]#
A set of tensors or a set expression representing tensors that must be reused across spatial iterations. Spatial loops may only be placed that reuse ALL tensors given here.
Note: Loops may be removed if they do not reuse a tensor given here and they do not appear in another loop bound constraint.
- class accelforge.frontend.arch.TensorHolder[source]#
Bases:
ComponentA TensorHolder is a component that holds tensors. These are usually Memories, but can also be Tolls.
- __init__(*args, **kwargs)#
Create a new model by parsing and validating input data from keyword arguments.
Raises [ValidationError][pydantic_core.ValidationError] if the input data cannot be validated to form a valid model.
self is explicitly positional-only to allow self as a field name.
- actions: EvalableList[TensorHolderAction]#
The actions that this TensorHolder can perform.
- bits_per_action: EvalsTo[int | float | None]#
The number of bits accessed in each of this component’s actions. Overridden by bits_per_action in any action of this component. If set here, acts as a default value for the bits_per_action of all actions of this component.
- class accelforge.frontend.arch.Tensors[source]#
Bases:
EvalableModelFields that control which tensor(s) are kept in a
TensorHolderand in what order their nodes may appear in the mapping.- back: TryEvalTo[InvertibleSet[str]]#
A set expression describing which tensors must be backed by this
accelforge.frontend.arch.TensorHolder. If this is not defined, then no tensors must be backed.
- force_memory_hierarchy_order: bool#
If set to true, storage nodes for lower-level memories must be placed below storage nodes for higher-level memories. For example, all MainMemory storage nodes must go above all LocalBuffer storage nodes.
This constraint always applies to same-tensor storage nodes (e.g., MainMemory reusing Output must go above LocalBuffer reusing Output); turning it off will permit things like MainMemory reusing Output going above LocalBuffer reusing Input.
This is identical to the force_memory_hierarchy_order field in the FFM class, but only applies to this tensor holder.
- keep: TryEvalTo[InvertibleSet[str]]#
A set expression describing which tensors must be kept in this
accelforge.frontend.arch.TensorHolder. If this is not defined, then all tensors must be kept. Any tensors that are inbackwill also be added tokeep.
- may_keep: TryEvalTo[InvertibleSet[str]]#
A set expression describing which tensors may optionally be kept in this
accelforge.frontend.arch.TensorHolder. The mapper will explore both keeping and not keeping each of these tensors. If this is not defined, then all tensors may be kept.
- no_refetch_from_above: TryEvalTo[InvertibleSet[str]]#
The tensors that are not allowed to be refetched from above. This is given as a set of
TensorNameobjects or a set expression that resolves to them. These tensors must be fetched at most one time from above memories, and may not be refetched across any temporal or spatial loop iterations. Tensors may be fetched in pieces (if they do not cause re-fetches of any piece).
- tensor_order_options: EvalableList[EvalableList[TryEvalTo[InvertibleSet[str]]]]#
Options for the order of tensor storage nodes in the mapping. This is given as a list-of-lists-of-sets. Each list-of-sets is a valid order of tensor storage nodes. Order is given from highest in the mapping to lowest.
For example, an option could be [input | output, weight], which means that there is no relative ordering required between input and output, but weight must be below both.
- tile_shape: EvalableList[Comparison]#
The tile shape for each rank variable. This is given as a list of
Comparisonobjects, where each comparison must evaluate to True for a valid mapping.
- class accelforge.frontend.arch.Toll[source]#
Bases:
TensorHolderA Toll is a TensorHolder that does not store data over time, and therefore does not allow for temporal reuse. Use this as a toll that charges reads and writes every time a piece of data moves through it.
Every write to a Toll is immediately written to the next Memory (which may be above or below depending on where the write came from), and same for reads.
The access counts of a Toll are only included in the “read” action. Each traversal through the Toll is counted as a read. Writes are always zero.
- actions: EvalableList[TensorHolderAction]#
The actions that this Toll can perform.
accelforge.frontend.config module#
- class accelforge.frontend.config.Config[source]#
Bases:
EvalableModel- component_models: EvalableList[str | ComponentModel]#
A list of hwcomponents models to use for the energy and area calculations. These can either be paths to Python files that contain the models, or hwcomponents
ComponentModelobjects.
- expression_custom_functions: EvalableList[str | Callable]#
A list of functions to use while parsing expressions. These can either be functions or paths to Python files that contain the functions. If a path is provided, then all functions in the file will be added to the evaluator.
- classmethod from_yaml(f)[source]#
Loads a dictionary from one more more yaml files.
Each yaml file should contain a dictionary. Dictionaries are combined in the order they are given.
Keyword arguments are also added to the dictionary.
- Parameters:
files – A list of yaml files to load.
jinja_parse_data – Optional[Dict[str, Any]] A dictionary of Jinja2 data to use when parsing the yaml files.
top_key – Optional[str] The top key to use when parsing the yaml files.
kwargs – Extra keyword arguments to be passed to the constructor.
- Return type:
- Returns:
A dict containing the combined dictionaries.
accelforge.frontend.model module#
accelforge.frontend.renames module#
- class accelforge.frontend.renames.EinsumRename[source]#
Bases:
EvalableModelRenames for a single Einsum.
- __init__(*args, **kwargs)[source]#
Create a new model by parsing and validating input data from keyword arguments.
Raises [ValidationError][pydantic_core.ValidationError] if the input data cannot be validated to form a valid model.
self is explicitly positional-only to allow self as a field name.
- name: str#
The name of the Einsum. Set this to “default” to apply the renames to all Einsums, unless overridden. Overriding is specific to a single name, so every rename in the default must be overridden independently.
- rank_variables: EvalableList[Rename]#
Renames for the rank variables of this Einsum. This may be given either as a dictionary
{new_name: source_set_expression}expressions, or as a list of dictionaries, each one having the structure{name: new_name, source: source_set_expression, expected_count: 1}, where expected count is optional for each and may be set to any integer.
- tensor_accesses: EvalableList[Rename]#
Renames for the tensor accesses of this Einsum. This may be given either as a dictionary
{new_name: source_set_expression}expressions, or as a list of dictionaries, each one having the structure{name: new_name, source: source_set_expression, expected_count: 1}, where expected count is optional for each and may be set to any integer.
- class accelforge.frontend.renames.Rename[source]#
Bases:
EvalableModelA rename of something into something else.
- class accelforge.frontend.renames.RenameList[source]#
Bases:
EvalableList[Rename]A list of renames.
- class accelforge.frontend.renames.Renames[source]#
Bases:
EvalableModel- einsums: list[EinsumRename]#
Renames for a workload. The Einsum list is a list of EinsumRename objects, and renames will be applied to Einsums whose names match the EinsumRename.name. If an EinsumRename is named “default”, then its renames are applied to every Einsum unless overridden. Overriding is specific to a single name, so every rename in the default must be overridden independently.
accelforge.frontend.spec module#
- class accelforge.frontend.spec.Spec[source]#
Bases:
EvalableModelThe top-level spec of all of the inputs to this package.
- calculate_component_area_energy_latency_leak(einsum_name=None, area=True, energy=True, latency=True, leak=True)[source]#
Populates per-component area, energy, latency, and/or leak power. For each component, populates the
area,total_area,leak_powerandtotal_leak_power. Additionally, for each action of each component, populates the<action>.energyand<action>.latencyfields. Extends thecomponent_modeling_logfield with log messages. Also populates thecomponent_modelattribute for each component if not already set.Some architectures’ attributes may depend on the workload. In that case, an Einsum name can be provided to populate those symbols with the Einsum’s symbols from the workload.
- Parameters:
einsum_name (EinsumName | None = None) – Optional Einsum name to populate symbols with the Einsum’s symbols from the workload. If None, and there are Einsums in the workload, the first Einsum is used. If None and there are no Einsums in the workload, then no symbols are populated from the workload.
area (bool, optional) – Whether to compute and populate area entries.
energy (bool, optional) – Whether to compute and populate energy entries.
latency (bool, optional) – Whether to compute and populate latency entries.
leak (bool, optional) – Whether to compute and populate leak power entries.
- Return type:
- map_workload_to_arch(einsum_names=None, print_number_of_pmappings=True, _pmapping_row_filter_function=None)[source]#
Maps the workload to the architecture using the AccelForge Fast and Fusiest Mapper (FFM).
- Parameters:
spec – The Spec to map.
einsum_names (
list[str] |None) – The einsum names to map. If None, all einsums will be mapped.can_combine_multiple_runs (Whether we would like to be able to combine multiple) – make_pmappings runs. Having this as True allows you to do things like pmappings = make_pmappings(*args_a) | make_pmappings(*args_b) but slows down execution.
cache_dir – The directory to cache pmappings in. If None, no caching will be done.
print_number_of_pmappings (
bool) – Whether to print the number of pmappings for each einsum._pmapping_row_filter_function (
Optional[Callable[[Series],bool]]) – A function that takes in a row of the pmapping dataframe and returns True if the row should be included in the final mappings, and False otherwise. If None, all rows will be included.
- Returns:
The mappings of the workload to the architecture.
- Return type:
- mapping: Mapping#
How the workload is programmed onto the architecture. Do not specify this if you’d like the mapper to generate a mapping for you.
accelforge.frontend.variables module#
accelforge.frontend.workload module#
All the objects used for a Workload description in AccelForge.
- class accelforge.frontend.workload.Einsum[source]#
Bases:
EvalableModelRepresents an Einsum, which is a single computation step in the workload. The Einsum includes a set of rank variables, which are used to index into tensors. Rank variables iterate through an iteration space.
For example, if the Einsum is A[m, n] += B[k, n] * C[k, n] and we define the iteration space as “0 <= m < 10, 0 <= n < 10, 0 <= k < 10”, then the Einsum will iterate through all possible values of (m, n, k) in the iteration space, indexing into tensors for each and updating A[m, n] with B[k, n] * C[k, n].
- __init__(*args, **kwargs)[source]#
Create a new model by parsing and validating input data from keyword arguments.
Raises [ValidationError][pydantic_core.ValidationError] if the input data cannot be validated to form a valid model.
self is explicitly positional-only to allow self as a field name.
- copy_source_tensor()[source]#
If this Einsum is a copy operation, returns the name of the tensor that is the source of the copy. Otherwise, returns None.
- property indexing_expressions: set[str]#
Returns a list of all the expressions that index into the tensors of this Einsum.
- is_copy_operation: bool#
Whether the Einsum is a copy operation. Copy operations take the input tensor and directly place them at the location of the output tensor(s) without any computation. If the destination tensor is at the same location, then this is a no-op.
- iteration_space_shape: Shape[str]#
Bounds of valid rank variable values. This is a list of expressions, each one an ISL expression. Additionally, global iteration_space_shape expressions are appended to the list if their rank variables are present in the Einsum’s rank_variables. For example, if the global scope has “m: 0 <= m < 10” and the Einsum has “m” in its rank_variables, then “0 <= m < 10” will be appended to the iteration_space_shape.
- n_instances: int#
Number of times to repeat the Einsum. Multiplied by Workload.n_instances to get the total number of Einsum instances. Energy, latency, and other summable metrics are multiplied by this value. Persistent reservations are also multiplied by this value, but non-persistent reservations are not, as they are assumed to be freed between each instance.
- rank_sizes: EvalableDict[str, int]#
Sizes of ranks. This is a dictionary of rank names to sizes. Sizes are integers, and the rank’s bounds are 0 <= rank < size. Accesses outside of these bounds are skipped.
- property rank_variable2ranks: dict[str, set[str]]#
Returns a dictionary of rank variables to the ranks that are indexed into by that rank variable.
- renames: RenameList[Rename]#
Renames of the Einsum. Renames here can be used to rename rank variables or tensors. When this Einsum is executed on an architecture, the architecture can use renamed tensors and rank variables to access the tensors and rank variables.
- property tensor2directly_indexing_rank_variables: dict[str, set[str]]#
Returns a dictionary of tensor names to the rank variables that directly index into that tensor. Direct indexing means that the rank variable is used as a direct index into the tensor, without any expression (e.g., “M=m”, NOT “M=m+n”).
- property tensor2expression_indexing_rank_variables: dict[str, set[str]]#
Returns a dictionary of tensor names to the rank variables that indirectly index into that tensor through an expression (e.g., “M=m+n”) instead of a direct index (e.g., “M=m”).
- property tensor2irrelevant_rank_variables: dict[str, set[str]]#
Returns a dictionary of tensor names to the rank variables that are irrelevant to that tensor. Irrelevant rank variables are rank variables that are not used to index into the tensor.
- property tensor2rank_variables: dict[str, set[str]]#
Returns a dictionary of tensor names to the rank variables that project into that tensor.
- tensor_accesses: EvalableList[TensorAccess]#
The tensors accessed by this Einsum, and how they are accessed.
- class accelforge.frontend.workload.ImpliedProjection[source]#
Bases:
dictHolds a projection that has been implied by a list of rank variables. The implied rank names are uppercased versions of the rank variables; for example, [a, b, c] -> {A: a, B: b, C: c}.
- class accelforge.frontend.workload.Shape[source]#
Bases:
EvalableListSpecifies valid values for the rank variables. This is a list of strings, each one an ISL expression. The total space is considered to be the logal AND of all the expressions in the list.
- class accelforge.frontend.workload.TensorAccess[source]#
Bases:
EvalableModelInformation about how an Einsum accesses a tensor.
- backing_storage_size_scale: float#
If != 1, then the backing storage size will be scaled by this factor.
- property directly_indexing_rank_variables: set[str]#
Returns the rank variables that directly index into this tensor without any expression (e.g., “M=m”, NOT “M=m+n”).
- property expression_indexing_rank_variables: set[str]#
Returns the rank variables that indirectly index into this tensor through an expression (e.g., “M=m+n”) instead of a direct index (e.g., “M=m”).
- persistent: bool#
If True, then a copy of this tensor must remain in backing storage for the full duration of the workload’s execution.
- projection: dict[str, str] | list[str]#
How the rank variables of the Einsum project into the tensor. If this is a list, then it is assumed that each of the elements of the list is a single rank variable and they index into the tensor in ranks that equal the uppercase of the rank variable. For example:
name: X, projection: [a, b, c] means X[A=a, B=b, C=c]
If this is a dictionary, it is a mapping from rank names to rank variable expressions. This can be used to either project into a non-matching rank name or to project into a tensor using an expression. For example:
name: X, projection: {A: a, B2: b, C: a+b} means X[A=a, B2=b, C=a+b]
- property rank2rank_variables: dict[str, set[str]]#
Returns a dictionary of rank names to the rank variables that project into that rank.
- class accelforge.frontend.workload.Workload[source]#
Bases:
EvalableModelThe workload specification as a cascade of Einsums, with each Einsum being a computation step in the workload.
- accesses_for_tensor(tensor)[source]#
Returns all TensorAccess objects that access the given tensor across all Einsums.
- Parameters:
tensor (TensorName) – The tensor to check.
- Returns:
The TensorAccess objects that access the given tensor across all Einsums. Order is the same as the order in this workload’s Einsums list.
- Return type:
- bits_per_value: EvalableDict[str, int | str]#
Bits per value for each tensor. The workload-level bits_per_value is overridden if bits_per_action is specified for any given tensor access. This is a dictionary of set expressions to bits per value for the tensors given by those expressions. For example, we may write “Inputs: 8” to set the bits per value to 8 for all input tensors, unless overridden.
- einsums_with_tensor(tensor)[source]#
Returns the Einsums in the workload that access the given tensor.
- Parameters:
tensor (TensorName) – The tensor to check.
- Returns:
The Einsums in the workload that access the given tensor. Order is the same as the order in this workload’s Einsums list.
- Return type:
- einsums_with_tensor_as_input(tensor)[source]#
Returns the Einsums in the workload that use the given tensor as an input.
- Parameters:
tensor (TensorName) – The tensor to check.
- Returns:
The Einsums in the workload that use the given tensor as an input. Order is the same as the order in this workload’s Einsums list.
- Return type:
- einsums_with_tensor_as_output(tensor)[source]#
Returns the Einsums in the workload that have the given tensor as an output.
- Parameters:
tensor (TensorName) – The tensor to check.
- Returns:
The Einsums in the workload that have the given tensor as an output. Order is the same as the order in this workload’s Einsums list.
- Return type:
- get_iteration_space_shape_isl_string(einsum_name)[source]#
Returns the ISL string representing the iteration space of the given Einsum.
- get_tensor_copies()[source]#
Returns a dictionary specifying which tensors are copies of which other tensors. For example, if einsum A copies tensor X into tensors Y and Z, then we’d have in the return value X: {Y, Z}, Y: {X, Z}, and Z: {X, Y}. This is transitive.
- Returns:
A dictionary specifying which tensors are copies of which other tensors. The keys are the tensors that are copies, and the values are sets of tensors that are copies of the key.
- Return type:
- iteration_space_shape: EvalableDict[str, str]#
Bounds of valid rank variable values. This is a dictionary of rank variable names to bounds of valid rank variable values. The bounds are specified as a string in the ISL format. For example, “0 <= a < 10” means that the rank variable a must be between 0 and 10, including 0 but not 10. Bounds are included for all Einsums that include that rank variable.
- n_instances: int#
Number of times to repeat the workload. Multiplied by Einsum.n_instances to get the total number of Einsum instances. Energy, latency, and other summable metrics are multiplied by this value. Persistent reservations are also multiplied by this value, but non-persistent reservations are not, as they are assumed to be freed between each instance.
- accelforge.frontend.workload.isl_expression_has_variable(expression, variable)[source]#
Returns True if the given ISL expression has the given rank variable.
- Parameters:
expression (str) – The ISL expression to check.
variable (RankVariable) – The rank variable to check for.
- Returns:
True if the given ISL expression has the given rank variable.
- Return type:
Module contents#
Timeloop Spec. Each piece below (minus processors) corresponds to a top key in the Timeloop spec.
- class accelforge.frontend.Spec[source]#
Bases:
EvalableModelThe top-level spec of all of the inputs to this package.
- calculate_component_area_energy_latency_leak(einsum_name=None, area=True, energy=True, latency=True, leak=True)[source]#
Populates per-component area, energy, latency, and/or leak power. For each component, populates the
area,total_area,leak_powerandtotal_leak_power. Additionally, for each action of each component, populates the<action>.energyand<action>.latencyfields. Extends thecomponent_modeling_logfield with log messages. Also populates thecomponent_modelattribute for each component if not already set.Some architectures’ attributes may depend on the workload. In that case, an Einsum name can be provided to populate those symbols with the Einsum’s symbols from the workload.
- Parameters:
einsum_name (EinsumName | None = None) – Optional Einsum name to populate symbols with the Einsum’s symbols from the workload. If None, and there are Einsums in the workload, the first Einsum is used. If None and there are no Einsums in the workload, then no symbols are populated from the workload.
area (bool, optional) – Whether to compute and populate area entries.
energy (bool, optional) – Whether to compute and populate energy entries.
latency (bool, optional) – Whether to compute and populate latency entries.
leak (bool, optional) – Whether to compute and populate leak power entries.
- Return type:
- map_workload_to_arch(einsum_names=None, print_number_of_pmappings=True, _pmapping_row_filter_function=None)[source]#
Maps the workload to the architecture using the AccelForge Fast and Fusiest Mapper (FFM).
- Parameters:
spec – The Spec to map.
einsum_names (
list[str] |None) – The einsum names to map. If None, all einsums will be mapped.can_combine_multiple_runs (Whether we would like to be able to combine multiple) – make_pmappings runs. Having this as True allows you to do things like pmappings = make_pmappings(*args_a) | make_pmappings(*args_b) but slows down execution.
cache_dir – The directory to cache pmappings in. If None, no caching will be done.
print_number_of_pmappings (
bool) – Whether to print the number of pmappings for each einsum._pmapping_row_filter_function (
Optional[Callable[[Series],bool]]) – A function that takes in a row of the pmapping dataframe and returns True if the row should be included in the final mappings, and False otherwise. If None, all rows will be included.
- Returns:
The mappings of the workload to the architecture.
- Return type:
- mapping: Mapping#
How the workload is programmed onto the architecture. Do not specify this if you’d like the mapper to generate a mapping for you.