remove-lm-mask #175

david-thrower · 2025-04-12T03:44:12Z

It appears a phishing_email_detection_gpt2.py line 407 needs the zero mask set to False. This is an AI - introduced error.

embedded = tf.keras.layers.Embedding(
    input_dim=VOCABULARY_SIZE,
    output_dim=EMBEDDING_DIM,
    input_length=max_seq_length,
    mask_zero=True)(tokens)

The text was updated successfully, but these errors were encountered:

david-thrower · 2025-04-13T03:34:50Z

Do not merge

Counterintuitively, even though the mask is not being passed to the embedding from the tokenizer, setting the parameter zero_mask to False on the Embedding layer appears deleterious to the NLP model's performance, evidence by these trials (https://github.com/david-thrower/cerebros-core-algorithm-alpha/actions/runs/14415992085/job/40432514634):

Trial #     val_binary_accuracy
Trial 1:    0.9499 
Trial 2:    0.9387

Where at least 0.952 was expected, See https://github.com/david-thrower/cerebros-core-algorithm-alpha/actions/runs/14415528912/job/40431381425 (0.97 ) and https://github.com/david-thrower/cerebros-core-algorithm-alpha/releases/tag/v0.10.0-alpha. This change should be tested further, just to see if this face value appearance of performance degradation was spurious, which it may be with a sample size of 2. For now, it appears to be unwise to merged this until there is evidence to discredit this possibility.

david-thrower added triage/high-priority audience/technical Issue primarily for technical review and service. kind/performance labels Apr 12, 2025

david-thrower self-assigned this Apr 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

remove-lm-mask #175

remove-lm-mask #175

david-thrower commented Apr 12, 2025

david-thrower commented Apr 13, 2025

Uh oh!

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

remove-lm-mask #175

remove-lm-mask #175

Comments

david-thrower commented Apr 12, 2025

david-thrower commented Apr 13, 2025

Do not merge

Uh oh!

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.