Skip to content

Reduce T5 model size and enhance perfomances #6

@GabrielePicco

Description

@GabrielePicco

Scenario summary

Current inference with t5 models is slow

Proposed solution

Investigate and implement solution to reduce model size and speed-up inference, some of the ideas to consider:

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions