Build changes
- Add FdFinishedFunc util to mediapipe
- rename a config setting to BUILD_FOR_OSS
- #mediapipe #ios remove custom cpp version (rely on the common cpp version set at build time)
- Rely on the common cpp version set at build time.
Framework and core calculator improvements
- Update C++ Graph Builder to support source layers.
- Bump MP version for release 0.10.23.
- Add Back-Edge support in Graph builder.
- Add a destructor to WebGpuAsyncFuture that correctly frees any pending future.
- Add tools for logging Tensors, ImageFrames and cv::Mats
- Add a utility for creating a view of a Tensor into an OpenCV Mat
- Add WebGpuCreateRenderPipelineAsync utility.
- [mediapipe] update documentation mentioning python versions
- Bump MP version for release 0.10.24.
- Add support for
GemmaV2-2B
via XNNPACK. - Remove obsolete checks that integer division rounds to zero.
- Inline SafeIntStrongIntValidator::SanityCheck function
- Debug logging: Fix and properly support logging RGBA images
- Fix modules/face_detection documentation.
- Add LogHalideBuffer variant for logging Halide buffers
- Add support for
GemmaV3-1B
models using XNNPACK. - Correct documentation to reflect actual behavior.
- Fix KleidiAI repository URL.
- Removed usage of deprecated InitFromGraphWithTransforms.
- Dynamically quantize inputs only once before projecting to queries, keys, and values.
- add an enum option to spectrogram calculator to output frames with all channels instead of vector of matrices
- Fix GlBufferView (bug: incomplete move constructor)
- Don't recreate write views on the same internal-only-use tensor (which triggers error messages) and fix read/write view usages.
- Support loading PackWeightsCache from a file descriptor
- Update flag description to use correct name for input_token_limit
- Reduce logging frequency for some warnings.
- Allow header output for all resampling strategies.
- Fix failing build:
blaze --blazerc=/dev/null build //third_party/mediapipe/examples/ios/facedetectioncpu:FaceDetectionCpuApp.apple_binary --config=ios_arm64 --ios_minimum_os=12.0
- Add std::vector output support to ConstantSidePacketCalculator
- Avoid creating unused StatusRep objects on each CalculatorNode::ProcessNode call
- Avoid creating multiple status reps on each mediapipe::tool::StatusStop() call
- Add option to process timestamp bound for ImmediateMuxCalculator.
MediaPipe Tasks update
This section should highlight the changes that are done specifically for any platform and don't propagate to
other platforms.
Android
- Move the callback registration into the InferenceSession.
- Add
updateSessionConfig
getSetencePieceProcessor
API to Java interface. - Add getSessionOptions method to LlmInferenceSession.
- This enables cloning for OpenCL-backed inference sessions
- Adding support for prompt templates
- Adds support to cancel async generation.
- Expose the max number of image to process to unlock vision for multi-modal processing
- Remove unnecessary chunk for add image API
- Declare the dependency of the OpenCL libraries, so that clients don't have to.
iOS
- Add vision modality support in swift API.
- Moving skia conversion to LLM c lib.
Javascript
- Remove artificial limits on maxBufferSize and maxStorageBufferBindingSize for LLM Inference on web.
- Use different parameters (topk, temperature) for gemma3
- Web LLM Inference: better error messaging for re-entry occurring from callback
- Add toggle for allowing the forcing of float32 precision for LLM Inference on web
Python
- Create a Packet containing a vector of ImageFrames. Get a list of ImageFrames from a Packet.
- Remove unused parameter from a docstring.
- Avoid unnecessary copy of ImageFrames.
- Add extra settings (disallowing service default initialization) for the base solution and allow setting it from pose solution.
- No public description
- Create a script that runs the AI Edge Converter for all models in models.json
- Support bundling additional .tflite models in .task
- Enabling LoRA for Gemma3 conversions
- Update llm bundler to put vision in .task
MediaPipe Dependencies
- Update WASM files for 0.10.22 release