add gptfast decoder #11

Sanster · 2024-06-04T05:48:01Z

Add decoder with static kv-cache from gpt-fast. Manually checked the results of the images in dataset/mini_pubtabnet/val, but have not actually run the acc/TEDS metrics on the test set.

Benchmark cell detection model with:

3090
max_decode_len = 512
fp32

The main modifications in full_pipeline.ipynb.

Specify which class to use for the decoder:

backbone = ImgLinearBackbone(d_model=d_model, patch_size=patch_size)
encoder = Encoder(
    d_model=d_model,
    nhead=nhead,
    dropout = dropout,
    activation="gelu",
    norm_first=True,
    nlayer=12,
    ff_ratio=4,
)
# decoder_class = Decoder
decoder_class = GPTFastDecoder

Initialize the decoder in load_vocab_and_model, call map_state_dict when using GPTFastDecoder.

def load_vocab_and_model(..., decoder_class: Type[nn.Module]):
    decoder = decoder_class(
         d_model=d_model,
         nhead=nhead,
         dropout = dropout,
         activation="gelu",
         norm_first=True,
         nlayer=4,
         ff_ratio=4,   
    )
    model = EncoderDecoder(
        backbone=backbone,
        encoder=encoder,
        decoder=decoder,
        ...
    )

    state_dict = torch.load(model_weights, map_location="cpu")
    if isinstance(model.decoder, GPTFastDecoder):
        state_dict = map_state_dict(state_dict)
    
    model.load_state_dict(state_dict)
    model = model.to(device)
    return vocab, model

In autoregressive_decode, if GPTFastDecoder is used, setup_caches needs to be called first.

def autoregressive_decode(...):
    model.eval()
    is_gpt_fast = isinstance(model.decoder, GPTFastDecoder)
    if is_gpt_fast:
        with torch.device(image.device):
            model.decoder.setup_caches(max_batch_size=image.shape[0], max_seq_length=max_decode_len, dtype=image.dtype)
    memory = model.encode(image)
    ...

add gptfast decoder

893c8c4

Sanster mentioned this pull request Jun 4, 2024

dataset Annotation #6

Open

Sanster added 2 commits June 4, 2024 17:15

fix clean cross_attention kv cache

f830d85

consider batch_size when update kv cache

228421d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add gptfast decoder #11

add gptfast decoder #11

Sanster commented Jun 4, 2024 •

edited

Loading

add gptfast decoder #11

Are you sure you want to change the base?

add gptfast decoder #11

Conversation

Sanster commented Jun 4, 2024 • edited Loading

Sanster commented Jun 4, 2024 •

edited

Loading