Object recognition

Applied Filters

People

Publications

Publication Date

Searched The ACM Guide to Computing Literature (3,777,513 records)|Limit your search to The ACM Full-Text Collection (762,665 records)

Showing 1 - 20of330 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

research-article
April 2024
Fully Sparse Fusion for 3D Object Detection
IEEE Transactions on Pattern Analysis and Machine Intelligence (ITPM), Volume 46, Issue 11Pages 7217–7231https://doi.org/10.1109/TPAMI.2024.3392303
Currently prevalent multi-modal 3D detection methods rely on dense detectors that usually use dense Bird’s-Eye-View (BEV) feature maps. However, the cost of such BEV feature maps is quadratic to the detection range, making it not scalable for long-...
0
Metrics
Total Citations0
research-article
April 2024
VST++: Efficient and Stronger Visual Saliency Transformer
IEEE Transactions on Pattern Analysis and Machine Intelligence (ITPM), Volume 46, Issue 11Pages 7300–7316https://doi.org/10.1109/TPAMI.2024.3388153
While previous CNN-based models have exhibited promising results for salient object detection (SOD), their ability to explore global long-range dependencies is restricted. Our previous work, the Visual Saliency Transformer (VST), addressed this constraint ...
0
Metrics
Total Citations0
research-article
April 2024
Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning
- Sijin Chen,
- Hongyuan Zhu,
- Mingsheng Li,
- Xin Chen,
- Peng Guo,
- Yinjie Lei,
- Gang Yu,
- Taihao Li,
- Tao Chen
IEEE Transactions on Pattern Analysis and Machine Intelligence (ITPM), Volume 46, Issue 11Pages 7331–7347https://doi.org/10.1109/TPAMI.2024.3387838
3D dense captioning requires a model to translate its understanding of an input 3D scene into several captions associated with different object regions. Existing methods adopt a sophisticated “detect-then-describe” pipeline, which builds ...
0
Metrics
Total Citations0
research-article
April 2024
Representing Noisy Image Without Denoising
IEEE Transactions on Pattern Analysis and Machine Intelligence (ITPM), Volume 46, Issue 10Pages 6713–6730https://doi.org/10.1109/TPAMI.2024.3386985
A long-standing topic in artificial intelligence is the effective recognition of patterns from noisy images. In this regard, the recent data-driven paradigm considers 1) improving the representation robustness by adding noisy samples in training phase (...
0
Metrics
Total Citations0
research-article
April 2024
PPDM++: Parallel Point Detection and Matching for Fast and Accurate HOI Detection
- Yue Liao,
- Si Liu,
- Yulu Gao,
- Aixi Zhang,
- Zhimin Li,
- Fei Wang,
- Bo Li
IEEE Transactions on Pattern Analysis and Machine Intelligence (ITPM), Volume 46, Issue 10Pages 6826–6841https://doi.org/10.1109/TPAMI.2024.3386891
Human-Object Interaction (HOI) detection aims to understand human activities by detecting interaction triplets. Previous HOI detection methods adopt a two-stage instance-driven paradigm. Unfortunately, many non-interactive human-object pairs generated by ...
0
Metrics
Total Citations0
research-article
Open Access
March 2024
<italic>FeatAug-DETR:</italic> Enriching One-to-Many Matching for DETRs With Feature Augmentation
IEEE Transactions on Pattern Analysis and Machine Intelligence (ITPM), Volume 46, Issue 9Pages 6402–6415https://doi.org/10.1109/TPAMI.2024.3381961
One-to-one matching is a crucial design in DETR-like object detection frameworks. It enables the DETR to perform end-to-end detection. However, it also faces challenges of lacking positive sample supervision and slow convergence speed. Several recent ...
0
Metrics
Total Citations0
research-article
March 2024
Gradient-Based Instance-Specific Visual Explanations for Object Specification and Object Discrimination
IEEE Transactions on Pattern Analysis and Machine Intelligence (ITPM), Volume 46, Issue 9Pages 5967–5985https://doi.org/10.1109/TPAMI.2024.3380604
We propose the gradient-weighted Object Detector Activation Maps (ODAM), a visual explanation technique for interpreting the predictions of object detectors. Utilizing the gradients of detector targets flowing into the intermediate feature maps, ODAM ...
0
Metrics
Total Citations0
research-article
March 2024
Turning a CLIP Model Into a Scene Text Spotter
IEEE Transactions on Pattern Analysis and Machine Intelligence (ITPM), Volume 46, Issue 9Pages 6040–6054https://doi.org/10.1109/TPAMI.2024.3379828
We exploit the potential of the large-scale Contrastive Language-Image Pretraining (CLIP) model to enhance scene text detection and spotting tasks, transforming it into a robust backbone, FastTCM-CR50. This backbone utilizes visual prompt learning and ...
0
Metrics
Total Citations0
research-article
March 2024
On Boundary Discontinuity in Angle Regression Based Arbitrary Oriented Object Detection
- Yi Yu,
- Feipeng Da
IEEE Transactions on Pattern Analysis and Machine Intelligence (ITPM), Volume 46, Issue 10Pages 6494–6508https://doi.org/10.1109/TPAMI.2024.3378777
With vigorous development e.g., in autonomous driving and remote sensing, oriented object detection has gradually been featured. The majority of existing methods directly perform regression on the rotation angle, which we argue has fundamental limitations ...
0
Metrics
Total Citations0
research-article
November 2023
Unified Adversarial Patch for Visible-Infrared Cross-Modal Attacks in the Physical World
IEEE Transactions on Pattern Analysis and Machine Intelligence (ITPM), Volume 46, Issue 4Pages 2348–2363https://doi.org/10.1109/TPAMI.2023.3330769
Physical adversarial attacks have put a severe threat to DNN-based object detectors. To enhance security, a combination of visible and infrared sensors is deployed in various scenarios, which has proven effective in disabling existing single-modal ...
0
Metrics
Total Citations0
research-article
September 2023
Mutual-Assistance Learning for Object Detection
IEEE Transactions on Pattern Analysis and Machine Intelligence (ITPM), Volume 45, Issue 12Pages 15171–15184https://doi.org/10.1109/TPAMI.2023.3319634
Object detection is a fundamental yet challenging task in computer vision. Despite the great strides made over recent years, modern detectors may still produce unsatisfactory performance due to certain factors, such as non-universal object features and ...
6
Metrics
Total Citations6
research-article
September 2023
Attribute-Guided Collaborative Learning for Partial Person Re-Identification
- Haoyu Zhang,
- Meng Liu,
- Yuhong Li,
- Ming Yan,
- Zan Gao,
- Xiaojun Chang,
- Liqiang Nie
IEEE Transactions on Pattern Analysis and Machine Intelligence (ITPM), Volume 45, Issue 12Pages 14144–14160https://doi.org/10.1109/TPAMI.2023.3312302
Partial person re-identification (ReID) aims to solve the problem of image spatial misalignment due to occlusions or out-of-views. Despite significant progress through the introduction of additional information, such as human pose landmarks, mask maps, ...
5
Metrics
Total Citations5
research-article
August 2023
QDTrack: Quasi-Dense Similarity Learning for Appearance-Only Multiple Object Tracking
IEEE Transactions on Pattern Analysis and Machine Intelligence (ITPM), Volume 45, Issue 12Pages 15380–15393https://doi.org/10.1109/TPAMI.2023.3301975
Similarity learning has been recognized as a crucial step for object tracking. However, existing multiple object tracking methods only use sparse ground truth matching as the training objective, while ignoring the majority of the informative regions in ...
2
Metrics
Total Citations2
research-article
July 2023
Multiscale Dynamic Graph Representation for Biometric Recognition With Occlusions
IEEE Transactions on Pattern Analysis and Machine Intelligence (ITPM), Volume 45, Issue 12Pages 15120–15136https://doi.org/10.1109/TPAMI.2023.3298836
Occlusion is a common problem with biometric recognition in the wild. The generalization ability of CNNs greatly decreases due to the adverse effects of various occlusions. To this end, we propose a novel unified framework integrating the merits of both ...
0
Metrics
Total Citations0
research-article
October 2022
End2End Occluded Face Recognition by Masking Corrupted Features
IEEE Transactions on Pattern Analysis and Machine Intelligence (ITPM), Volume 44, Issue 10_Part_2Pages 6939–6952https://doi.org/10.1109/TPAMI.2021.3098962
With the recent advancement of deep convolutional neural networks, significant progress has been made in general face recognition. However, the state-of-the-art general face recognition models do not generalize well to occluded face images, which are ...
12
Metrics
Total Citations12
research-article
October 2022
Fast and Robust Multi-Person 3D Pose Estimation and Tracking From Multiple Views
IEEE Transactions on Pattern Analysis and Machine Intelligence (ITPM), Volume 44, Issue 10_Part_2Pages 6981–6992https://doi.org/10.1109/TPAMI.2021.3098052
This paper addresses the problem of reconstructing 3D poses of multiple people from a few calibrated camera views. The main challenge of this problem is to find the cross-view correspondences among noisy and incomplete 2D pose predictions. Most previous ...
11
Metrics
Total Citations11
research-article
October 2022
Joint Detection and Matching of Feature Points in Multimodal Images
- Elad Ben Baruch,
- Yosi Keller
IEEE Transactions on Pattern Analysis and Machine Intelligence (ITPM), Volume 44, Issue 10_Part_1Pages 6585–6593https://doi.org/10.1109/TPAMI.2021.3092289
In this work, we propose a novel Convolutional Neural Network (CNN) architecture for the joint detection and matching of feature points in images acquired by different sensors using a single forward pass. The resulting feature detector is tightly coupled ...
2
Metrics
Total Citations2
research-article
October 2022
Segment as Points for Efficient and Effective Online Multi-Object Tracking and Segmentation
IEEE Transactions on Pattern Analysis and Machine Intelligence (ITPM), Volume 44, Issue 10_Part_1Pages 6424–6437https://doi.org/10.1109/TPAMI.2021.3087898
Current multi-object tracking and segmentation (MOTS) methods follow the tracking-by-detection paradigm and adopt 2D or 3D convolutions to extract instance embeddings for instance association. However, due to the large receptive field of deep ...
1
Metrics
Total Citations1
research-article
October 2022
Concealed Object Detection
IEEE Transactions on Pattern Analysis and Machine Intelligence (ITPM), Volume 44, Issue 10_Part_1Pages 6024–6042https://doi.org/10.1109/TPAMI.2021.3085766
We present the first systematic study on concealed object detection (COD), which aims to identify objects that are visually embedded in their background. The high intrinsic similarities between the concealed objects and their background make COD far more ...
49
Metrics
Total Citations49
research-article
September 2022
Bayesian Embeddings for Few-Shot Open World Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence (ITPM), Volume 46, Issue 3Pages 1513–1529https://doi.org/10.1109/TPAMI.2022.3201541
As autonomous decision-making agents move from narrow operating environments to unstructured worlds, learning systems must move from a closed-world formulation to an open-world and few-shot setting in which agents continuously learn new classes from small ...
0
Metrics
Total Citations0

Applied Filters

People

Names

Institutions

Authors

Reviewers

Publications

All Publications

Content Type

Publisher

Publication Date

Fully Sparse Fusion for 3D Object Detection

VST++: Efficient and Stronger Visual Saliency Transformer

Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning

Representing Noisy Image Without Denoising

PPDM++: Parallel Point Detection and Matching for Fast and Accurate HOI Detection

<italic>FeatAug-DETR:</italic> Enriching One-to-Many Matching for DETRs With Feature Augmentation

Gradient-Based Instance-Specific Visual Explanations for Object Specification and Object Discrimination

Turning a CLIP Model Into a Scene Text Spotter

On Boundary Discontinuity in Angle Regression Based Arbitrary Oriented Object Detection

Unified Adversarial Patch for Visible-Infrared Cross-Modal Attacks in the Physical World

Mutual-Assistance Learning for Object Detection

Attribute-Guided Collaborative Learning for Partial Person Re-Identification

QDTrack: Quasi-Dense Similarity Learning for Appearance-Only Multiple Object Tracking

Multiscale Dynamic Graph Representation for Biometric Recognition With Occlusions

End2End Occluded Face Recognition by Masking Corrupted Features

Fast and Robust Multi-Person 3D Pose Estimation and Tracking From Multiple Views

Joint Detection and Matching of Feature Points in Multimodal Images

Segment as Points for Efficient and Effective Online Multi-Object Tracking and Segmentation

Concealed Object Detection

Bayesian Embeddings for Few-Shot Open World Recognition