Weekly GitHub Report for Tensorflow: September 08, 2025 - September 15, 2025 (12:05:15)
Weekly GitHub Report for Tensorflow
Thank you for subscribing to our weekly newsletter! Each week, we deliver a comprehensive summary of your GitHub project's latest activity right to your inbox, including an overview of your project's issues, pull requests, contributors, and commit activity.
Table of Contents
I. News
1.1 Recent Version Releases:
The current version of this repository is v2.19.0
1.2 Version Information:
Released on March 5, 2025, TensorFlow version 2.19.0 introduces breaking changes to the tf.lite
API, including the deprecation of tf.lite.Interpreter
in favor of ai_edge_litert.interpreter
and changes to certain C++ constants for better API compatibility. Key updates also include runtime support for the bfloat16
data type in the tfl.Cast
operation and the discontinuation of standalone libtensorflow
package publishing, while still allowing unpacking from PyPI.
II. Issues
2.1 Top 5 Active Issues:
We consider active issues to be issues that that have been commented on most frequently within the last week. Bot comments are omitted.
-
[GPU][BiasAdd] Kernel launch count overflow with large Dense/bias_add (negative
work_element_count
→ abort): This issue reports a bug in TensorFlow 2.20.0 where the GPU kernel launch count overflows during aBiasAdd
operation on very large tensors, causing a fatal check failure due to a negativework_element_count
. The problem occurs with both Keras Dense layers using bias and rawtf.nn.bias_add
calls when usingbfloat16
data type on tensors with billions of elements, leading to crashes in eager execution and potentially under XLA compilation or SavedModel workflows.- A user confirmed reproducing the bug on Colab with TensorFlow 2.20.0 and GPU enabled, providing a reference notebook to demonstrate the issue.
- Number of comments this week: 1
-
TensorFlow/Keras model accumulates system and GPU RAM during training: This issue describes a problem where training a TensorFlow/Keras model causes both system RAM and GPU RAM usage to continuously increase, eventually exhausting memory and terminating the training process. The user is employing TensorFlow 2.19.0 with a custom data generator and a custom embedding loss function, and is unsure why the memory accumulation occurs during training.
- The single comment suggests trying alternative memory allocators such as jemalloc or TCmalloc by preloading them before running the training script, aiming to address potential memory management issues causing the RAM buildup.
- Number of comments this week: 1
-
Note and Clarify the Error Raised in
tf.raw_ops.ExtractGlimpse()
andtf.raw_ops.ExtractGlimpseV2()
: This issue addresses a documentation bug in TensorFlow'stf.raw_ops.ExtractGlimpse()
andtf.raw_ops.ExtractGlimpseV2()
functions, specifically clarifying the error message related to the simultaneous specification ofuniform_noise
andnoise
parameters. The current error description is misleading because it does not accurately reflect the condition under which the error is raised, and the issue requests that this clarification be added to the docstrings of both functions.- A pull request has been opened to fix the documentation and clarify the error message as discussed in the issue.
- Number of comments this week: 1
-
Clarify the Error Raised in
tf.train.CheckpointManager()
: This issue addresses a documentation bug in the TensorFlow API where the error raised bytf.train.CheckpointManager()
whencheckpoint_interval
is set without providing astep_counter
is not clearly documented. The reporter demonstrates that the current docstring lacks information about this required parameter, which leads to aValueError
during usage, and suggests that the documentation should explicitly mention this condition.- A pull request has been opened to address the documentation gap highlighted in the issue, aiming to clarify the error conditions related to the
step_counter
parameter intf.train.CheckpointManager()
. - Number of comments this week: 1
- A pull request has been opened to address the documentation gap highlighted in the issue, aiming to clarify the error conditions related to the
-
Clarify the range of action of
select_cols
intf.io.decode_csv()
andtf.raw_ops.DecodeCSV()
: This issue concerns a documentation bug in TensorFlow'stf.io.decode_csv()
andtf.raw_ops.DecodeCSV()
functions, specifically regarding the unclear behavior and range of theselect_cols
parameter. The reporter highlights thatselect_cols
appears to only apply to therecords
argument and not torecord_defaults
, which is not documented, causing confusion and aValueError
when their lengths do not match.- A pull request has been opened to address the documentation clarification, indicating that the issue is being actively worked on and the confusion around
select_cols
usage is being resolved. - Number of comments this week: 1
- A pull request has been opened to address the documentation clarification, indicating that the issue is being actively worked on and the confusion around
2.2 Top 5 Stale Issues:
We consider stale issues to be issues that has had no activity within the last 30 days. The team should work together to get these issues resolved and closed as soon as possible.
- TF-TRT Warning: Could not find TensorRT: This issue describes a problem where TensorFlow on Ubuntu 22.04 cannot detect TensorRT, resulting in a warning message "TF-TRT Warning: Could not find TensorRT" despite having compatible NVIDIA drivers and CUDA versions installed. The user reports difficulties with driver versions, specifically needing to downgrade from the default 550 driver to 535 for their RTX 3050 Ti GPU, and has tried reinstalling TensorFlow multiple times without success, seeking assistance to resolve the TensorRT detection failure.
SystemError
intf.ensure_shape
andtf.compat.v1.ensure_shape
whendtype
ofshape
istf.uint64
and its value is too large.: This issue reports a bug in TensorFlow where usingtf.ensure_shape
ortf.compat.v1.ensure_shape
with ashape
tensor of typetf.uint64
containing very large values close to 2^64 causes aSystemError
andOverflowError
. Specifically, when a shape like[18446743219011059112, 1]
is passed in eager execution mode, the APIs fail with aSystemError
related to the built-inisinstance
function, indicating improper handling of large unsigned 64-bit integers.- Feature Request: Integrate different Digital Signal Processing into tf.signal: This issue is a feature request proposing the integration of advanced digital signal processing (DSP) functionalities, similar to those found in the PyTorch julius library, into TensorFlow's tf.signal module. The requester highlights the benefits of having native audio data augmentation tools within TensorFlow to improve model robustness and streamline workflows by reducing reliance on external libraries.
- [DOCS] Missing complex input for Round op: This issue reports a documentation bug in TensorFlow where the
Round
operation is described as supporting complex tensor inputs, but in practice, attempting to use a complex tensor with this operation results in an error, requiring users to round the real and imaginary parts separately. The user provides a reproducible example and error logs showing that the operation fails due to a missing device kernel for complex inputs, highlighting a discrepancy between the official documentation and the actual behavior in TensorFlow version 2.15.0. - tf.raw_ops.Unbatch aborts with "Check failed: d < dims()": This issue reports a bug in TensorFlow version 2.17 where the operation
tf.raw_ops.Unbatch
aborts with a fatal check failure error "Check failed: d < dims()". The user has reproduced this problem on Linux Ubuntu 20.04.3 using Python 3.11.8 with custom code, and the error occurs due to an invalid dimension check during the unbatching process, causing the program to crash.
2.3 Open Issues
This section lists, groups, and then summarizes issues that were created within the last week in the repository.
Issues Opened This Week: 16
Summarized Issues:
- Null Pointer Dereference Bugs in TensorFlow Components: Multiple issues report potential null pointer dereference bugs in TensorFlow's XLA and MLIR components due to missing return statements or unchecked null variables, which can lead to crashes or undefined behavior during execution. These bugs highlight the need for better null checks and code safety in critical TensorFlow internals.
- issues/99907, issues/99909
- Crashes and Fatal Errors in TensorFlow 2.20.0: Several issues describe fatal errors including GPU kernel launch count overflow causing aborts, segmentation faults after PyVista's screenshot method, and memory exhaustion during model training, indicating stability problems in TensorFlow 2.20.0 and 2.19.0 under specific conditions. These problems affect both GPU operations and memory management, leading to crashes or termination of processes.
- issues/99894, issues/99925, issues/100085
- Symbol Export and Linking Conflicts in TensorFlow Shared Libraries: The shared library
libtensorflow_framework.so.2
exports unnecessary symbols from dependencies like LLVM, causing symbol resolution conflicts and potential crashes. The proposed solution involves building with-Bsymbolic
and using version scripts to restrict symbol exports and improve stability. - issues/99991
- Documentation Clarifications and Error Message Improvements: Multiple issues request clearer documentation and error messages for TensorFlow functions, including
ExtractGlimpse
,CheckpointManager
,Dequantize
,decode_csv
, andQuantizedMaxPool
. These clarifications aim to improve user understanding of parameter behaviors, error causes, and expected inputs to prevent confusion and misuse. - issues/100134, issues/100138, issues/100175, issues/100223, issues/100234
- Unexpected Behavior and Bugs in TensorFlow API Functions: Some TensorFlow API functions exhibit unexpected or incorrect behavior, such as
tf.nn.depthwise_conv2d
working with unsupported dilation and stride combinations, andtf.random.stateless_uniform
throwing exceptions with certain algorithm parameters. These bugs contradict official documentation or expected functionality. - issues/100237, issues/100252
- User Support and Diagnostic Requests for TensorFlow Lite on Android: An issue requests detailed system information, SDK versions, and logs to diagnose a problem with TensorFlow Lite in Google Play Services on Android devices, indicating ongoing support and troubleshooting efforts for mobile deployment.
- issues/100317
- Model Compilation and Metrics Handling Issues: A reported ValueError occurs when compiling a TensorFlow model with multiple outputs using a single-entry metrics list, highlighting the inconvenience of having to specify metrics separately for each output. This points to limitations in the current metrics API design.
- issues/100319
- Security Feature Implementation Discussion: There is a discussion about implementing stored S/MIME methods to enhance Gmail message security, indicating interest in integrating advanced security protocols.
- issues/100320
2.4 Closed Issues
This section lists, groups, and then summarizes issues that were closed within the last week in the repository. This section also links the associated pull requests if applicable.
Issues Closed This Week: 3
Summarized Issues:
- Build and Compatibility Errors: Several issues describe failures related to TensorFlow's compatibility with specific Python versions and system configurations. One issue highlights build failures due to SyntaxWarning errors from invalid escape sequences in Python 3.12 when compiling TensorFlow 2.10.1 with CUDA support, while another reports an ImportError caused by a DLL initialization failure in TensorFlow 2.19.0, preventing the native runtime from loading properly.
- [issues/99885, issues/100244]
- API Usage Errors: There is a reported bug involving incorrect usage of the
tensorflow.keras.layers.LSTM
layer in TensorFlow 2.20.0, where passing an invalid keyword argumentinputs=
instead of the expected input tensor leads to aTypeError
about a missing required argumentsequences
. This indicates strict argument requirements in the API that can cause runtime errors if not followed correctly. - [issues/100300]
2.5 Issue Discussion Insights
This section will analyze the tone and sentiment of discussions within this project's open and closed issues that occurred within the past week. It aims to identify potentially heated exchanges and to maintain a constructive project environment.
Based on our analysis, there are no instances of toxic discussions in the project's open or closed issues from the past week.
III. Pull Requests
3.1 Open Pull Requests
This section provides a summary of pull requests that were opened in the repository over the past week. The top three pull requests with the highest number of commits are highlighted as 'key' pull requests. Other pull requests are grouped based on similar characteristics for easier analysis. Up to 25 pull requests are displayed in this section, while any remaining pull requests beyond this limit are omitted for brevity.
Pull Requests Opened This Week: 9
Key Open Pull Requests
1. Note and Clarify the Error Raised in tf.raw_ops.ExtractGlimpse() and tf.raw_ops.ExtractGlimpseV2(): This pull request aims to improve the clarity and detail of the error messages raised by the TensorFlow operations tf.raw_ops.ExtractGlimpse()
and tf.raw_ops.ExtractGlimpseV2()
by updating their API definitions and source code documentation.
- URL: pull/100135
- Merged: No
2. Add AddressSanitizer (ASan) build support to Bazel config and macros, #99792: This pull request adds support for building TensorFlow with AddressSanitizer (ASan) using Bazel by introducing ASan-specific build flags, updating Bazel configuration files and macros to propagate these flags, and fixing related build and linker issues to enable detection of memory errors such as buffer overflows and use-after-free bugs.
- URL: pull/99900
- Merged: No
3. tf.math.argmax
support for int16 tensor in axis.: This pull request fixes a memory corruption issue in the tf.math.argmax
function when the axis parameter is provided as an int16 tensor by implementing safe casting to int32 and adding support for int, int32_t, and int64_t types, along with a test case to verify correct handling of int16 inputs.
- URL: pull/99904
- Merged: No
Other Open Pull Requests
- Nullptr Dereference Prevention: Multiple pull requests focus on preventing nullptr dereference errors in different components of TensorFlow. These include adding missing return statements and nullptr checks to enhance code safety and stability.
- pull/99908, pull/99910
- oneDNN 3D Pooling Core Dump Fixes: Several pull requests address core dump issues in oneDNN 3D pooling operations by handling corner cases and fixing related bugs. These fixes improve the robustness of maxpooling and AvgPooling3DGrad operations.
- pull/99921, pull/99922
- Documentation and Error Message Clarifications: Some pull requests update documentation and error messages to clarify parameter behavior and improve user understanding. This includes clarifying the select_cols parameter in CSV decoding functions and error messages in checkpoint management.
- pull/100235, pull/100140
3.2 Closed Pull Requests
This section provides a summary of pull requests that were closed in the repository over the past week. The top three pull requests with the highest number of commits are highlighted as 'key' pull requests. Other pull requests are grouped based on similar characteristics for easier analysis. Up to 25 pull requests are displayed in this section, while any remaining pull requests beyond this limit are omitted for brevity.
Pull Requests Closed This Week: 1
Key Closed Pull Requests
1. This is spam: This pull request, titled "This is spam," appears to be an unmerged contribution to the TensorFlow project containing generic commit messages labeled "Summary" and a merge from a forked branch, suggesting it was likely identified as irrelevant or spam.
- URL: pull/100266
- Merged: No
3.3 Pull Request Discussion Insights
This section will analyze the tone and sentiment of discussions within this project's open and closed pull requests that occurred within the past week. It aims to identify potentially heated exchanges and to maintain a constructive project environment.
Based on our analysis, there are no instances of toxic discussions in the project's open or closed pull requests from the past week.
IV. Contributors
4.1 Contributors
Active Contributors:
We consider an active contributor in this project to be any contributor who has made at least 1 commit, opened at least 1 issue, created at least 1 pull request, or made more than 2 comments in the last month.
If there are more than 10 active contributors, the list is truncated to the top 10 based on contribution metrics for better clarity.
Contributor | Commits | Pull Requests | Issues | Comments |
---|---|---|---|---|
ILCSFNO | 11 | 5 | 9 | 6 |
mihaimaruseac | 0 | 0 | 0 | 16 |
psamanoelton | 2 | 1 | 0 | 11 |
Venkat6871 | 1 | 0 | 0 | 11 |
zvoicu000 | 5 | 3 | 0 | 3 |
No author found | 10 | 0 | 0 | 0 |
adi1220 | 4 | 1 | 0 | 5 |
springcrane | 0 | 0 | 5 | 5 |
Aaraviitkgp | 6 | 0 | 0 | 3 |
aravindhbalaji1985 | 0 | 1 | 2 | 5 |