#OpenMAX IL Introduction and Architecture This section of the document describes the OpenMAX IL features and architecture.
##2.1 OpenMAX IL Description The OpenMAX IL layer is an API that defines a software interface used to provide an access layer around software components in a system. The intent of the software interface is to take components with disparate initialization and command methodologies and provide a software layer that has a standardized command set and a standardized methodology for construction and destruction of the components.
###2.1.1 Architectural Overview Consider a system that requires the implementation of four multimedia processing functions denoted as F1, F2, F3, and F4. Each of these functions may be from different vendors or may be developed in house but by different groups within the organization. Each may have different requirements for setup and teardown. Each may have different methods of facilitating configuration and data transfer. The OpenMAX IL API provides a means of encapsulating these functions, singly or in logical groups, into components.
The API includes a standard protocol that enables compliant components that are potentially from different vendors/groups to exchange data with one another and be used interchangeably.
The OpenMAX IL API interfaces with a higher-level entity denoted as the IL client, which is typically a functional piece of a filter graph multimedia framework or an application. The IL client interacts with a centralized IL entity called the core. The IL client uses the OpenMAX core for loading and unloading components, setting up direct communication between two OpenMAX components, and accessing the component’s method functions.
An IL client always communicates with a component via the IL core. In most cases, this communication equates to calling one of the IL core’s macros, which translates directly to a call on one of the component methods. Exceptions (where the IL client calls an actual core function that works) include component creation and destruction and connection via tunneling of two components.
Components embody the media processing function or functions. Although this specification clearly defines the functionality of the OpenMAX core, the component provider defines the functionality of a given component. Components operate on four types of data that are defined according to the parameter structures that they export: audio, video, image, and other (e.g., time data for synchronization).
An OpenMAX component provides access to a standard set of component functions via its component handle. These functions allow a client to get and set component and port configuration parameters, get and set the state of the component, send commands to the component, receive event notifications, allocate buffers, establish communications with a single component port, and establish communication between two component ports.
Every OpenMAX component shall have at least one port to claim OpenMAX conformance. Although a vendor may provide an OpenMAX-compatible component without ports, the bulk of conformance testing is dependent on at least one conformant port. The four types of ports defined in OpenMAX correspond to the types of data a port may transfer: audio, video, and image data ports, and other ports. Each port is defined as either an input or output depending on whether it consumes or produces buffers. In a system containing four multimedia processing functions F1, F2, F3, and F4, a system implementer might provide a standard OpenMAX interface for each of the functions. The implementer might just as easily choose any combination of functions. The delineation for the separation of this functionality is based on ports. Figure 2-1 shows a few possible partitions for an OpenMAX implementation that provides these functions.
Figure 2-1. Possible Partitions for an OpenMAX Implementation
###2.1.2 Key Vocabulary This section describes acronyms and definitions commonly used in describing the OpenMAX IL API.
####2.1.2.1 Acronyms Table 2-1 lists acronyms commonly used in describing the OpenMAX IL API.
| Acronym | Meaning | | ------------- | | IPC | Abbreviation of inter-processor communication.| | OMX | Used as a prefix in the names of OpenMAX functions and structures. For example, a component may be place in the OMX_StateExecuting state.|
Table 2-1. Acronyms
####2.1.2.2 Key Definitions Table 2-2 lists key definitions used in describing the OpenMAX IL API.
| Key word | Meaning | | ------------- | | Accelerated component | OpenMAX components that wrap a function with a portion running on an accelerator. Accelerated components have special characteristics such as being able to support some types of tunneling. | | Accelerator | Hardware designed to speed up processing of some functions.This hardware may also be referred to as accelerated hardware.Note that the accelerator may actually be software running in adifferent processor and not be hardware at all. | | AMR | Abbreviation of adaptive multimedia retrieval, which is an adaptive multi-rate codec from the 3GGP consortium. | | Host processor | The processor in a multi-core system that controls media acceleration and typically runs a high-level operating system. | | IL client | The layer of software that invokes the methods of the core or component. The IL client may be a layer below the GUI application, such as GStreamer, or may be several layers below the GUI layer. In this document, the application refers to any software that invokes the OpenMAX methods. | | Main memory | Typically external memory that the host processor and the accelerator share. | | OpenMAX component | A component that is intended to wrap functionality that is required in the target system. The OpenMAX wrapper provides a standard interface for the function being wrapped. | | OpenMAX core | Platform-specific code that has the functionality necessary to locate and then load an OpenMAX component into main memory. The core also is responsible for unloading the component from memory when the application indicates that the component is no longer needed. In general, after the OpenMAX core loads a component into memory, the core will not participate in communication between the application and the component.| | Resource manager | A software entity that manages hardware resources in the system. | | RTP | Abbreviation of real-time protocol, which is the Internet-standard protocol for the transport of real-time data, including audio and video. | | Synchronization | A mechanism for gating the operation of one component with another. | | Tunnels/Tunneling | The establishment and use of a standard data path that is managed directly between two OpenMAX components. |
Table 2-2. Key Definitions ###2.1.3 System Components Figure 2-2 depicts the various types of communication enabled with OpenMAX. Each component can have an arbitrary number of ports for data communication. Components with a single output port are referred to as source components. Components with a single input port are referred to as sink components. Components running entirely on the host processor are referred to as host components. Components running on a loosely coupled accelerator are referred to as accelerator components. OpenMAX may be integrated directly with an application or may be integrated with multimedia framework components enabling heterogeneous implementations.
Three types of communication are described. Non-tunneled communications defines a mechanism for exchanging data buffers between the IL client and a component. Tunneling defines a standard mechanism for components to exchange data buffers directly with each other in a standard way. Proprietary communication describes a proprietary mechanism for direct data communications between two components and may be used as an alternative when a tunneling request is made, provided both components are capable of doing so.
Figure 2-2. OpenMAX IL API System Components ####2.1.3.1 Component Profiles OpenMAX component functionality is grouped into two profiles: base profile and interop profile.
The base profile shall support non-tunneled communication. Base profile components may support proprietary communication. Base profile components do not support tunneled communication.
The interop profile is a superset of the base profile. An interop profile component shall support non-tunneled communication and tunneled communication. An interop profile component may support proprietary communication.
The primary difference between the interop profile and the base profile is that the component supports tunneled communication. The base profile exists to reduce the adoption barrier for OpenMAX implementers by simplifying the implementation. A base profile component does not need to implement tunneled communication.
###2.1.4 Component States Each OpenMAX component can undergo a series of state transitions, as depicted in Figure 2-3. Every component is first considered to be unloaded. The component shall be loaded through a call to the OpenMAX core. All other state transitions may then be achieved by communicating directly with the component.
A component can enter an invalid state when a state transition is made with invalid data. For example, if the callback pointers are not set to valid locations, the component may time out and alert the IL client of the error. The IL client shall stop, de-initialize, unload, and reload the component when the IL client detects an invalid state. Figure 2-3 depicts the invalid state as enterable from any state, although the only way to exit the invalid state is to unload and reload the component.
Figure 2-3. Component States
Transitioning into the IDLE state may fail since this state requires allocation of all operational resources. When the transition from LOADED to IDLE fails, the IL client may try again or may choose to put the component into the WAIT FOR RESOURCES state. Upon entering the WAIT FOR RESOURCE state, the component registers with a vendor-specific resource manager to alert it when resources have become available. The resource manager subsequently puts the component into the IDLE state. A command that the IL client sends controls all other state transitions except to INVALID.
The IDLE state indicates that the component has all of its needed resources but is not processing data. The EXECUTING state indicates that the component is pending reception of buffers to process data and will make required callbacks as specified in section 3. The PAUSED state maintains a context of buffer execution with the component without processing data or exchanging buffers. Transitioning from PAUSED to EXECUTING enables buffer processing to resume where the component left off.
Transitioning from EXECUTING or PAUSED to IDLE will cause the context in which buffers were processed to be lost, which requires the start of a stream to be reintroduced. Transitioning from IDLE to LOADED will cause operational resources such as communication buffers to be lost.
###2.1.5 Component Architecture Figure 2-4 depicts the component architecture. Note that there is only one entry point for the component (through its handle to an array of standard functions) but there are multiple possible outgoing calls that depend on how many ports the component has. Each component will make calls to a specified IL client event handler. Each port will also make calls (or callbacks) to a specified external function. A queue for pointers to buffer headers is also associated with each port. These buffer headers point to the actual buffers. The command function also has a queue for commands. All parameter or configuration calls are performed on a particular index and include a structure associated with that parameter or configuration, as depicted in Figure 2-4.
Figure 2-4. OpenMAX IL API Component Architecture
A port must support callbacks to the IL client and, when part of an interop profile component, must support communication with ports on other components.
###2.1.6 Communication Behavior
Configuration of a component may be accomplished once the handle to the component has been received from the OpenMAX core. Data communication calls with a component are non-blocking and are enabled once the number of ports has been configured, each port has been configured for a specific data format, and the component has been put in the appropriate state. Data communication is specific to a port of the component. Input ports are always called from the IL client with OMX_EmptyThisBuffer
(for more information, see section 3.2.2.17). Output ports are always called from the IL client with
OMX_FillThisBuffer
(for more information, see section 3.2.2.18). In an in-context implementation, callbacks to OMX_EmptyBufferDone
or OMX_FillBufferDone
will be made before the return. Figure 2-5 depicts the anticipated behavior for an in- context versus an out-of-context implementation. Note that the IL client should not make assumptions about return/callback sequences to enable heterogeneous integration of in-context and out-of-context OpenMAX components.
Figure 2-5. Out-of-Context versus In-Context Operation
Data communications with components is always directed to a specific component port. Each port has a component-defined minimum number of buffers it shall allocate or use. A port associates a buffer header with each buffer. A buffer header references data in the buffer and provides metadata associated with the contents of the buffer. Every component port shall be capable of allocating its own buffers or using pre-allocated buffers; one of these choices will usually be more efficient than the other.
###2.1.7 Tunneled Buffer Allocation and Sharing This section describes buffer allocation for tunneling components and buffer sharing. For a given tunnel, exactly one port supplies the buffers and passes those buffers to the non-supplier port. In the simplest case, the supplier also allocates the buffers. Under the right circumstances, however, a tunneling component may choose to re-use buffers from one port on another to avoid memory copies and optimize memory usage. This practice is known as buffer sharing.
A tunnel between any two ports represents a dependency between those ports. Buffer sharing extends that dependency so that all ports that share the same set of buffers form an implicit dependency chain. Exactly one port in that dependency chain allocates the buffers shared by all of them.
Buffer sharing is implemented within a component and is transparent to other components. The non-supplier port is unaware whether the supplier’s component allocated the buffers itself or re-used buffers from another of its ports. Furthermore, the supplier is unaware of whether the non-supplier’s component will re-use the buffers that the supplier provided.
Strictly speaking, a component is only obligated to obey the external semantics required of it and may implement buffer sharing behind those semantics. More specifically, external semantics require that a component do the following:
- Provide buffers on all of its supplier ports.
- Accurately communicate buffer requirements on its ports.
- Pass a buffer from an output port to an input port with an OMX_EmptyThisBuffer call.
- Return a buffer from an input port to an output port with an OMX_FillThisBuffer call.
If a component chooses to share buffers, its implementation may fulfill those requirements by doing the following:
- Provide re-used buffers on some supplier ports.
- Account for the needs of shared ports when communicating buffer requirements on ports.
- Internally pass a buffer from an input port to an output port between an OMX_EmptyThisBuffer call and its corresponding
OMX_EmptyBufferDone
call.
OpenMAX defines external component semantics to be compatible with sharing, although it does not explicitly require that a component support sharing. This section discusses the implementation of those semantics in the context of buffer sharing. If no components are sharing buffers, the implementation reduces to a simpler set of steps and obligations.
####2.1.7.1 Relevant Terms This section describes terms used in discussions of tunneled buffer allocation and sharing.Figure 2-6 illustrates the concepts.
Figure 2-6. Example of Buffer Allocation and Sharing Relationships
Among a pair of ports that are tunneling, the port that calls UseBuffer on its neighbor is known as a supplier port. A buffer supplier port does not necessarily allocate its buffers; it may re-use buffer from another port on the same component. Ports a and c in Figure 2-6 illustrate supplier ports.
The port that receives the UseBuffer calls from its neighbor is known as a non-supplier port. Ports b and d Figure 2-6 illustrate non-supplier ports.
A port’s tunneling port is the port neighboring it with which it shares a tunnel. For example, port b in Figure 2-6 is the tunneling port to port a. Likewise, port a is the tunneling port to port b.
An allocator port is a supplier port that also allocates its own buffers. Port a in Figure 2-6 is the only allocator port.
A sharing port is a port that re-uses buffers from another port on the same component. For example, port c in Figure 2-6 is a sharing port.
A tunneling component is a component that uses at least one tunnel.
The set of buffer requirements for a port includes the number of buffers required and the required size of each buffer. The maximum of multiple sets of buffer requirements is defined as the largest number of buffers mandated in any set combined with the largest size mandated in any set. One port retrieves buffer requirements from its tunneled port in a OMX_PORTDEFINITIONTYPE structure via an OMX_GetParameter call on the tunneled port's component. Note that one port may determine buffer requirements from a port that shares its buffers without resorting to an OMX_GetParameter call since they are both contained in the same component.
####2.1.7.2 IL Client Component Setup To set up tunneling components, the IL client shall perform the following setup operations in this order:
- Load all tunneling components and set up the tunnels on these components.
- Command all tunneling components to transition from the loaded state to the idle state.
If the IL client does not operate in this manner, a tunneling component might never transition to idle because of the possible dependencies between components.
####2.1.7.3 Component Transition from Loaded to Idle State with Sharing During the OMX_SetupTunnel call, the two ports of a tunnel establish which port (input or output) will act as the buffer supplier. Thus, when a component is commanded to transition from loaded to idle, it is aware of the roles of all its supplier or non-supplier ports.
When commanded to transition from loaded to idle, a component performs the following operations in this order:
-
The component determines what buffering sharing it will implement, if any. The following rules apply:
- a) A component may re-use a buffer only from one of its one input ports on one or more of its output ports or from one of its output ports on one of its input ports.
- b) Only a supplier port may re-use the buffers from another port.
- c) A component sharing buffers over multiple output ports requires read-only output port as shown in Figure 2-7.
Figure 2-7. Possible Sharing Relationships
- The component determines which of its supplier ports, if any, are also allocator ports. A supplier port is also an allocator port only if it does not re-use buffers from a non-supplier port on the same component (i.e., is not a sharing port). In Figure 2-8, a supplier port is a port with an arrow pointing away. A non-supplier port is a port with an arrow pointing toward it. An arrow from one port represents a sharing relationship. A port with boxes (buffers) adjacent to it represents an allocator port.
Figure 2-8. Determining Allocators
- The component allocates its buffers for each of its allocator ports as follows:
- a) For each port that re-uses the allocator ports buffer, the allocator port determines the buffer requirements of the sharing port. See obligation A below.
- b) The allocator port determines the buffer requirements of its tunneled port via an
OMX_GetParameter
call. See obligation B below. - c) The allocator port allocates buffers according to the maximum of its own requirements, the requirements of the tunneled port, and the requirement of all of the sharing ports.
- d) The allocator port informs the non-supplier port that it is tunneling with of the actual number of buffers via an
OMX_SetParameter
call onOMX_IndexParamPortDefinition
by setting the value ofnBufferCountActual
appropriately. See obligation E below. - e) The allocator port shares its buffers with each sharing port that re-uses its buffers. See obligation D below.
- f) For every allocated buffer, the allocator port calls
OMX_UseBuffer
on its tunneling port. See obligation C below.
A component shall also fulfill the following obligations:
- A. For a sharing port to determine its requirements, the sharing port shall first call
OMX_GetParameter
on its tunneled port to query for requirements and then return the maximum of its own requirements and the requirements of the tunneled ports. - B. When a non-supplier port receives an
OMX_GetParameter
call querying its buffer requirements, the non-supplier port shall first determine the requirements of all ports that re-use its buffers (see obligation A) and then return the maximum of its own requirements and those of its ports. - C. When a non-supplier port receives an
OMX_UseBuffer
call from its tunneled port, the non-supplier port shall share the buffer with all ports on that component that re-use it. - D. When a port A shares a buffer with a port B on the same component where port B re-uses the buffer of port A, then port B shall call
OMX_UseBuffer
and pass the buffer on its tunneled port. - E. When a non-supplier port receives a
OMX_SetParameter
call onOMX_IndexParamPortDefinition
from its tunneled port, the non-supplier port shall pass the nBufferCountActual field to any port that re-uses its buffers.
Likewise, each supplier port that receives the nBufferCountActual
field in this way shall pass the nBufferCount
to its tunneled port by performing an OMX_SetParameter
call on OMX_IndexParamPortDefinition
. The actual number of buffers used throughout the dependency chain is propagated in this way.
A component may transition from loaded to idle when all enabled ports have all the buffers they require.
In practice, there could be a direct mapping between the following:
- Steps 1-3 discussed earlier and code in the loaded-to-idle case in the state transition handler
- Obligation A and a subroutine to determine a shared ports buffer requirements
- Obligation B and the OMX_GetParameter implementation
- Obligation C and the OMX_UseBuffer implementation
- Obligation D and a subroutine to share a buffer from one port to another
To clarify why conformity to these steps and obligations leads to proper buffer allocation, consider the example illustrated in Figure 2-9. Note that this example is contrived to exercise every step and obligation outlined above, and is therefore more complex then most real use cases.
Figure 2-9. Example of Buffer Allocation
This discussion focuses only on the transition of component 3 to idle; similar operations occur inside the other components.
When the IL client commands component 3 to transition from loaded to idle, it follows the following prescribed steps:
- Component 3 notices that it can re-use port d’s buffers since port e is a supplier port. Component 3 establishes a sharing relationship from port d to port e.
- Component 3 decides that since port d is a supplier port that does not re-use buffers, port d shall be an allocator port.
- Component 3 allocates and distributes port d’s buffers:
- a) Since port e will re-use the buffer of port d, component 3 determines the buffer requirements of port e. In accordance with obligation A, port e calls
OMX_GetParameter
on port f to determine its buffer requirements and reports the requirements as the maximum between its own and those of port f. - b) Port d calls
OMX_GetParameter
on port c to determine its buffer requirements. In accordance with obligation B, port c shall determine the buffer requirements of port b. In accordance with obligation A, port b returns the maximum of its own requirements and the requirement of port a (retrieved viaOMX_GetParameter
) when queried. Port c then returns the maximum of its own requirements and the requirements that port b returns. - c) Port d allocates buffers according to the maximum of its own requirements and the requirements that ports c and e return. The resulting buffers are effectively allocated according to the maximum requirements of ports a, b, c, d, e, and f, all of which use the buffers of port d.
- d) Since port e will re-use the buffers of port d, component 3 shares these buffers with port e. In accordance with obligation D, port e calls
OMX_UseBuffer
on port f for every buffer that is shared. - e) For each buffer allocated, port d calls
OMX_UseBuffer
on port c. In accordance with obligation C, port c shares each buffer with port b. Port b, in turn, obeys obligation D and callsOMX_UseBuffer
on port a with the buffer.
- a) Since port e will re-use the buffer of port d, component 3 determines the buffer requirements of port e. In accordance with obligation A, port e calls
Since all ports of all components now have their buffers, all components may transition to idle.
####2.1.7.4 Protocol for Using a Shared Buffer When an input port receives a shared buffer via an OMX_EmptyThisBuffer call, the input port may re-use that buffer on an output port that it is sharing with the output port by obeying the following rules:
- The output port calls
OMX_EmptyThisBuffer
on its tunneling port before the input port sends the correspondingOMX_EmptyBufferDone
call to its tunneling port. - The input port does not call
OMX_EmptyBufferDone
until all output ports on which the buffer is shared (i.e., viaOMX_EmptyThisBuffer
calls) returnOMX_EmptyBufferDone
.
####2.1.7.5 Component Transition from Loaded to Idle State without Sharing If a component does not share buffers, the component implementation reduces to a simpler set of steps and obligations than the case for sharing buffers.
When commanded to transition from loaded to idle, a non-sharing component performs the following operations in this order:
- The component determines what buffering sharing it will implement, if any. In this case, there is no sharing.
- The component determines which of its supplier ports, if any, are also allocator ports. All supplier ports are allocator ports.
- The component allocates it buffers for each allocator port as follows:
- a. Since there is no sharing, the component does not ask the sharing port for requirements.
- b. The allocator determines the buffer requirements of its tunneled port via an
OMX_GetParameter
call. - c. The allocator allocates buffers according to the maximum of its own requirements and the requirements of the tunneled ports.
- d. Since there is no sharing, no buffers must be passed to sharing ports.
- e. For every allocated buffer, the allocator port calls
OMX_UseBuffer
on its tunneling port.
All component obligations described for sharing components do not apply to non-sharing components.
###2.1.8 Port Reconnection Port reconnection enables a tunneled component to be replaced with another tunneled component without having to tear down surrounding components. In Figure 2-10, component B1 is to be replaced with component B2. To do this, the component A output port and the component B input port shall first be disabled with the port disable command. Once all allocated buffers have returned to their rightful owner and freed, the component A output port may be connected to component B2. The component B1 output port and the component C input port should similarly be given the port disable command. After all allocated buffers have returned to their owners and freed, the component C input port may be connected to the component B2 output port. Then all ports may be given the enable command.
Figure 2-10. Port Reconnection
In some cases such as audio, reconnecting one component to another and then fading in data for one component while fading out data for the original component may be desirable. Figure 2-11 illustrates how this would work. In step 1, component A sends data to component B1, which then sends the data on to component C. Components A and C both have an extra port that is disabled. In step 2, the IL client first establishes a tunnel between component A and B2, then establishes a tunnel between B2 and C, and then enables all ports in the two tunnels. Component C may be able to mix data from components B1 and B2 at various gains, assuming that these are audio components. In step 3, the ports connected to component B1 from components A and C are disabled, and component B1 resources may be de-allocated.
Figure 2-11. Reconnecting Components
###2.1.9 Queues and Flush A separate command queue enables the component to flush buffers that have not been processed and return these buffers to the IL client when using non-tunneled communication, or to the tunneled port when using tunneled communication. In Figure 2-12, assume that the component has an output port that is using buffers allocated by the IL client. In this example, the client sends a series of five buffers to the component before sending the flush command. Upon processing the flush command, the component returns each unprocessed buffer in the original order, and finally triggers its event handler to notify the IL client. Two buffers were already processed before the flush command got processed. The component returns the remaining three buffers unfilled and generates an event. The IL client should wait for the event before attempting to de-initialize the component.
Figure 2-12. Flushing Buffers
###2.1.10 Marking Buffers An IL client can also trigger an event to be generated when a marked buffer is encountered. A buffer can be marked in its buffer header. The mark is internally transmitted from an input buffer to an output buffer in a chain of OpenMAX components. The mark enables a component to send an event to the IL client when the marked buffer is encountered. Figure 2-13 depicts how this works.
Figure 2-13. Marking Buffers The IL client sends a command to mark a buffer. The next buffer sent from the output port of the component is marked B1. Component B processes the B1 buffer and provides the results in buffer B2 along with the mark. When component C receives the marked buffer B2 through its input port, the component does not trigger its event handler until it has processed the buffer.
###2.1.11 Events and Callbacks Six kinds of events are sent by a component to the IL client:
- Error events are enumerated and can occur at any time
- Command complete notification events are triggered upon successful execution of a command.
- Marked buffer events are triggered upon detection of a marked buffer by a component.
- A port settings changed notification event is generated when the component changes its port settings.
- A buffer flag event is triggered when an end of stream is encountered.
- A resources acquired event is generated when a component gets resources that it has been waiting for.
Ports make buffer handling callbacks upon availability of a buffer or to indicate that a buffer is needed.
###2.1.12 Buffer Payload The port configuration is used to determine and define the format of the data to be transferred on a component port, but the configuration does not define how that data exists in the buffer.
There are generally three cases that describe how a buffer can be filled with data. Each case presents its own benefits.
In all cases, the range and location of valid data in a buffer is defined by the pBuffer, nOffset, and nFilledLength parameters of the buffer header. The pBuffer parameter points to the start of valid data in the buffer. The nOffset parameter indicates the number of bytes between the start of the buffer and the start of valid data. The nFilledLength parameter specifies the number of contiguous bytes of valid data in the buffer. The valid data in the buffer is therefore located in the range pBuffer + nOffset to pBuffer + nOffset + nFilledLength.
The following cases are representative of compressed data in a buffer that is transferred into or out of a component when decoding or encoding. In all cases, the buffer just provides a transport mechanism for the data with no particular requirement on the content. The requirement for the content is defined by the port configuration parameters.
The shaded portion of the buffer represents data and the white portion denotes no data.
Case 1: Each buffer is filled in whole or in part. In the case of buffers containing compressed data frames, the frames are denoted by f1 to fn.
Case 1 provides a benefit when decoding for playback. The buffer can accommodate multiple frames and reduce the number of transactions required to buffer an amount of data for decoding. However, this case may require the decoder to parse the data when decoding the frames. It also may require the decoder component to have a frame-building buffer in which to put the parsed data or maintain partial frames that would be completed with the next buffer.
Case 2: Each buffer is filled with only complete frames of compressed data.
Case 2 differs from case 1 because it requires the compressed data to be parsed first so that only complete frames are put in the buffers. Case 2 may also require the decoder component to parse the data for decoding. This case may not require the extra working buffer for parsing frames required in case 1.
Case 3: Each buffer is filled with only one frame of compressed data.
The benefit in case 3 is that a decoding component does not have to parse the data. Parsing would be required at the source component. However, this method creates a bottleneck in data transfer. Data transfer would be limited to one frame per transfer. Depending on the implementation, one transaction per frame could have a greater impact on performance than parsing frames from a buffer.
At a minimum, a decoder or encoder component would be required to support case 1. By definition, if a codec component can support case 1, then it can support cases 2 and 3, but only if the compression format allows for byte-aligned frame boundaries. Operating in case 2 or 3 may not make sense when, for example, configuring an Adaptive Multi-Rate (AMR) codec for RTP-payload format, bandwidth-efficient mode. The non-byte aligned frames defined by this format would not fit the byte-aligned frame boundaries defined by these cases.
When filling a buffer with compressed data for input to a decoder or output from an encoder, a problem with limiting the filling to complete frames only might arise when frames are not byte aligned. Padding would have to be added outside of any padding defined in the format specification. The padding would then need to be removed, since the data could not be appended as is. This would require knowledge of the padding bits outside of any standard specification. Likewise, if this padding were not in place to maintain compliance with the standards specification for the port configuration, complete frames could not always be placed in the buffers. In either case, specific knowledge of how this situation is handled would be required, and may be different between components.
For interoperability, the content delivered in a buffer should not be assumed or required to be any number of complete frames, although at least one complete unit of data will be delivered in a buffer for uncompressed data formats. Compressed data formats do not place restrictions on the amount of content delivered in each buffer.
###2.1.13 Buffer Flags and Timestamps Buffer flags associate certain properties (e.g., the end of a data stream) with the data contained in a buffer. A buffer timestamp associates a presentation time in microseconds with the data in the buffer used to time the rendering of that data. Once a timestamp is associated with a buffer, no component should alter the timestamp for rate control or synchronization, which are implemented in the clock component.
Buffer metadata (i.e., flags and timestamps) applies to the first new logical unit in the buffer. Thus, given the presence of multiple logical units in a buffer, the metadata applies to the logical unit whose starting boundary occurs in the buffer. Unless otherwise stated (e.g., in a flag definition), a component that receives a logical input unit marked with a flag or timestamp shall copy that metadata to all logical output units that the input contributes to.
###2.1.14 Synchronization Synchronization is enabled by the use of synchronization (sync) ports on a clock component. These ports and the clock component are defined within the “other” domain and operate with the same protocols and calls that regulate data ports. The clock component maintains a media clock that tracks the position in the media stream based on audio and video reference clocks. The clock component transmits buffers containing time information (denoted by a media time update and containing the media clock’s current position, scale, and state) to client components via sync ports. A client component may time the execution of an operation (e.g., the presentation of a video frame) to a timestamp by requesting that the clock component send that timestamp when it matches the media clock. In this case, the client component executes the operation when it receives the fulfillment of the request over its sync port. Figure 2-14 illustrates the flow of time and data buffers in an example configuration of components.
Figure 2-14. Flow of Time and Data Buffers ###2.1.15 Rate Control The clock component also implements all rate control by exposing a set of configurations for controlling its media clock. The IL client may change the scale factor of the media clock (effectively changing the rate and direction that the media clock advances) to implement play, fast forward, rewind, pause, and slow motion trick modes. The IL client may also start and stop the clock by using these configurations to change the state of the media clock. The clock component makes all of its client components aware of a change to the media clock scale and state by sending a media time update with the new scale or state on all sync ports. Although a component may not alter a buffer timestamp in reaction to a scale change, a component may alter its processing accordingly. For instance, an audio component might scale and pitch correct audio during trick modes or cease transmitting output entirely.
###2.1.16 Component Registration How components are registered with a core is generally core specific.
However, if the core supports static linking with components, then it will support a standard compile-time component registration scheme as described in section 3. Vendors can therefore supply components that are suitable for static linking with all cores that support it; this is achieved by placing component information into a data structure that is linked with the component and the core.
A component can be registered statically using this mechanism but have the bulk of its code dynamically loaded.
###2.1.17 Resource Management This section discusses the role of resource management in the OpenMAX IL API.
####2.1.17.1 Need for Resource Management When a component is not allowed to go to idle state due to lack of resources, the IL client has cannot know what the limited resource is or which components are using that resource. Therefore, the IL client cannot, for example, free up resources for a mandatory audio stream to play without turning off all of the IL components or having specific knowledge of IL component implementations, neither of which is a viable option. These situations necessitate IL resource management.
One of the goals of OpenMAX is hardware independence provided by the IL layer to the layers above it. The goal of hardware independence can be achieved by specifying the following requirements regarding resource management:
- An IL client (e.g., a multimedia plug-in that is typically part of a software platform) should not need to know the details of an IL implementation or which resource an IL component is using. For example, the IL client might have no information on whether a component is hardware accelerated or not.
- In case of resource conflicts, an IL client should be able to rely on consistent component behavior across IL implementations and hardware platforms.
- An IL client should not have to interface directly with a hardware vendor-specific resource manager for two reasons.
- This method violates the goal of hardware independence.
- This method adds considerable re-work to the IL client, which has an impact on the re-usability of the IL client on multiple hardware platforms.
Although resource management is not fully addressed in OpenMAX IL API version 1.0, “hooks” for resource management have been put in place in the form of behavioral rules, component priorities, and a resource management-related component state. These “hooks” lay the groundwork for full-fledged resource management in later versions of the OpenMAX IL API.
Before proceeding further, the terms resource management and policy are defined for the benefit of the discussion that follows:
- Resource management is responsible for managing the access of components to a limited resource. A resource manager will be aware of how much of a specific resource is available, which components are currently using the resource, and how much of the resource the components are using. A resource manager will recommend to policy which components should be pre-empted or resumed based on resource conflicts and availability.
- Policy is responsible for managing component chains or streams. The policy manager determines if a stream is allowed to run or resume based on information it receives from resource management, system configuration, requests from applications, or other factors.
####2.1.17.2 Architectural Assumptions The following discussion makes two architectural assumptions about the OpenMAX IL:
- Assumption 1: A framework exists that contains a policy manager between the applications and the OpenMAX IL.
- Assumption 2: A system can have one or more hardware platforms that are used by different OpenMAX components and that are managed by hardware vendor-specific resource manager(s).
These assumptions are illustrated in the high-level architecture shown in Figure 2-15. For systems that do not have a framework (that is, where user applications interface directly with the IL), version 1.0 of the OpenMAX IL API specification does not specify how resource management will be handled. Assumption 2 covers systems that have a single, centralized resource manager as well.
Figure 2-15. Architectural Assumptions To ensure consistent component behavior in case of resource conflicts, a common definition of component priority and a set of behavioral rules are needed.
####2.1.17.3 Component Priorities
Each IL component has a priority value (an OMX_U32 integer) that the IL client sets. The actual range of priorities can be left up to the platform, but the priority order is important and needs to be the same across IL implementations. A descending order of priority is chosen with 0 denoting the highest priority. The following tie-breaking rule also applies: When comparing components with the same priority, components that have acquired the resource most recently should be deemed to be of higher priority than components that have had the resource longer.
####2.1.17.4 Behavioral Rules The following behavior is defined on the IL layer:
- The
OMX_ErrorInsufficientResources
error is called only on a component that attempts to go to the idle state when there are insufficient resources and sufficientresources cannot be freed by preempting lower priority components. - A component is not aware that preemption is occurring when it tries to go to the idle state, and the resources it requires need to be freed by preempting lower priority components.
- When a component that already has resources needs to be preempted, it will send the
OMX_ErrorResourcesPreempted
andOMX_ErrorResourcesLost
errors to the IL client as it moves from the Executing or Paused state to the Idle state and from the Idle state to the Loaded state, respectively. - In cases where the IL client wants to know when the stream associated with the component can be resumed or started, the IL client shall request to be notified when resources are available. This occurs by putting the component into the OMX_StateWaitForResources state. When the resources become available, the component automatically goes to the idle state. When the client receives the notification that the component is in the idle state, it can try to move the rest of the components in that chain to the idle state as well. This automatic movement to the idle state ensures that in cases where multiple IL clients are waiting for the same resource, the IL client can resume or start the stream as soon as the resource is available. If the component were to automatically move just to the loaded state, then another IL client could grab that resource first. These behavioral rules are intended to cover only the interactions between the IL client(s) and the IL components.
####2.1.17.5 Hardware Vendor-Specific Resource Manager To implement the behavioral rules, a hardware vendor-specific resource manager will need to exist below the IL layer and perform the following functions:
- Implement and manage the wait queue(s).
- Keep track of available resources.
- Keep track of each component that has resources and which resources they are using.
- Notify a component or multiple components that they need to give up their resources when a higher priority component requests the resource.
- Notify the highest priority component waiting for a resource when the resource is available.
The actual interactions between the components and the hardware vendor-specific resource manager(s) are vendor-specific and outside the scope of this document. Section 3 provides more details of the parameter structures and use cases related to priority and resource management.