Skip to content

Can we stop lowering FC to Matmul+BatchedAdd? #2145

Open
@bertmaher

Description

@bertmaher

GEMM is almost always implemented as Y = AX+b, so we actually end up doing something less efficient by separating these into two nodes. It's pretty universal on HW backends, too. We could write new optimizations to re-fuse them, but why?

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions