PatentBrief

Systems and methods for attention-based configurable convolutional neural networks (ABC-CNN) for visual question answering

Described herein are systems and methods for generating and using attention-based deep learning architectures for visual question answering task (VQA) to automatically generate answers for image-related (still or video i…

Granted 2018activeExpires 2036Owned by Baidu USA LLCInvented by Kan Chen, Jiang Wang, Wei Xu

Original patent title: “Systems and methods for attention-based configurable convolutional neural networks (ABC-CNN) for visual question answering

What this patent covers

The actual claim

Described herein are systems and methods for generating and using attention-based deep learning architectures for visual question answering task (VQA) to automatically generate answers for image-related (still or video images) questions. To generate the correct answers, it is important for a model's attention to focus on the relevant regions of an image according to the question because different questions may ask about the attributes of different image regions. In embodiments, such question-guided attention is learned with a configurable convolutional neural network (ABC-CNN). Embodiments of the ABC-CNN models determine the attention maps by convolving image feature map with the configurable convolutional kernels determined by the questions semantics. In embodiments, the question-guided attention maps focus on the question-related regions and filters out noise in the unrelated regions.

What this patent does NOT cover

The boundaries

    These exclusions are unique to PatentBrief — derived from the actual claim language, not patent-office boilerplate.

    Systems and methods for attent…(Primary claim)

    Schematic visualization of the patent's claim structure. Hand-drawn diagrams in progress for each landmark patent.

    Patent Abstract

    Patent abstract

    Described herein are systems and methods for generating and using attention-based deep learning architectures for visual question answering task (VQA) to automatically generate answers for image-related (still or video images) questions. To generate the correct answers, it is important for a model's attention to focus on the relevant regions of an image according to the question because different questions may ask about the attributes of different image regions. In embodiments, such question-guided attention is learned with a configurable convolutional neural network (ABC-CNN). Embodiments of the ABC-CNN models determine the attention maps by convolving image feature map with the configurable convolutional kernels determined by the questions semantics. In embodiments, the question-guided attention maps focus on the question-related regions and filters out noise in the unrelated regions.

    Patent Journey

    From filing to today

    Patent Filed

    2016

    Patent Granted

    2018 · 2yr after filing

    Active Today

    2026

    Expires

    2036

    PatentBrief Score

    Impact Score

    54/ 100

    Moderate

    Citation count

    29/40

    Moderately cited

    Claim breadth

    15/20

    Broad claims

    Recency

    10/20

    Granted 5–10 years ago

    Assignee scale

    0/20

    Independent or smaller assignee

    PatentBrief Impact Score — based on citation count, claim breadth, recency, and assignee scale. Not a legal assessment.

    The original legal language

    Original claims

    23 claims as filed with the patent office.

    Citations

    Patent lineage

    Cites earlier patents

    2

    earlier patents this invention cites as foundations

    View prior art →

    Cited by later patents

    28

    later patents that build on this invention

    View patents →

    Stay in the loop

    Get a weekly digest of new patents.

    One email per week. No spam. Unsubscribe anytime.

    Keep exploring

    Related patents you should know

    US 12564871 · 2026

    A Fixture for Cleaning Showerheads with Multiple Separate Chambers

    This patent describes a cleaning device for showerheads that uses a fixture with three or more separate internal compartments and channels to direct cleaning fluid to the showerhead's upper surfaces.

    ASM IP HOLDING BV

    US 12324579 · 2025

    Surgical Stapler Battery Health Check During Operation

    This patent describes a powered surgical stapler that can detect if some of its rechargeable battery cells are damaged while it's actually firing staples, helping ensure the procedure finishes safely.

    CILAG GMBH INT

    US 12471982 · 2025

    Surgical Tool That Combines Energy Treatment and Stapling

    CILAG's patent details a surgical instrument that applies therapeutic energy to tissue, monitors its properties, then deploys staples, adapting the stapling based on the initial energy treatment and monitoring.

    CILAG GMBH INT

    US 11918209 · 2024

    Real-Time Surgical Instrument Status on Live Video During Operations

    This patent describes a surgical system that shows live video from inside the body and overlays important information about the surgical tool directly onto the screen, helping surgeons operate more precisely.

    CILAG GMBH INT

    US 8697359 · 2014

    How to Use CRISPR-Cas9 to Edit Genes in Human Cells

    This patent describes a method and system for precisely altering gene expression in eukaryotic cells, including human cells, using an engineered CRISPR-Cas9 system that targets and cleaves specific DNA sequences.

    Massachusetts Institute of Technology

    US 4683195 · 1987

    How to Make Many Copies of a Specific DNA Segment

    This patent describes the Polymerase Chain Reaction (PCR), a fundamental process for making millions of copies of a specific DNA or RNA segment from a tiny sample, enabling its detection.

    Cetus Corp

    Patent monitoring

    Get notified when Baidu USA LLC files a new patent

    Get notified when this company files a new patent. Weekly digest · Confirm via email · Unsubscribe anytime.

    Last reviewed: · PatentBrief is not a law firm and this is not legal advice.