Paul Jason Mello | Writings

Writings

Reports, paper reviews, thoughts, and pre-prints.

Pre-Print Jun 14, 2026

A Comparison of HGSADC vs ACS on the Site-Dependent Vehicle Routing Problem

Pre-print comparing HGSADC and ant colony metaheuristics on the site-dependent vehicle routing problem.

Read Report → View Code →
Pre-Print Jun 1, 2026

Diffusion Denoised Pruning

Pre-print on diffusion inspired pruning for layer-balanced subnetworks.

Read Report →
Research Nov 1, 2025

Echo State Networks

Research on echo state networks and reservoir computing.

Read Report → View Code →
Research Jun 2, 2025

Liquid Time-Constant Networks

Research on continuous-time recurrent networks with adaptive dynamics.

Read Report → View Code →
Research May 19, 2025

Sheaf Neural Networks

Research on sheaf-based graph neural networks.

Read Report → View Code →
Thoughts May 16, 2025

Tweedie's Formula

Notes on Tweedie's formula and the score-matching view of denoising.

Read Report →
Pre-Print May 5, 2025

Perturbation Driven Generalization

Pre-print on data augmentation strategies and model generalization.

Read Report → View Code →
Paper Review Oct 22, 2024

Learning to Compress: Local Rank and Information Compression in Deep Neural Networks

Review of local rank as a measure of feature manifold dimensionality.

Read Report →
Paper Review Oct 14, 2024

An Image is Worth 16x16 Words: Transformers For Image Recognition at Scale

Review of the Vision Transformer for image classification at scale.

Read Report →
Paper Review Oct 12, 2024

Training Compute-Optimal Large Language Models (Chinchilla)

Review of Chinchilla compute-optimal scaling laws for LLMs.

Read Report →
Paper Review Sep 28, 2024

Subword Regularization: Improving Neural Translation Models with Multiple Subword Candidates

Review of subword sampling as a regularizer for neural translation.

Read Report →
Paper Review Sep 22, 2024

Universal Language Model FineTuning for Text Classification

Review of ULMFiT transfer learning for text classification.

Read Report →
Paper Review Sep 13, 2024

Music Transformer Generating Music With Long-Term Structure

Review of relative attention for long-term music generation.

Read Report →
Paper Review Sep 6, 2024

Sequence to Sequence Learning with Neural Networks

Review of encoder-decoder LSTMs for sequence-to-sequence learning.

Read Report →
Paper Review Aug 31, 2024

Enriching Word Vectors with Subword Information

Review of subword n-grams enriching word embeddings (fastText).

Read Report →

Writings

A Comparison of HGSADC vs ACS on the Site-Dependent Vehicle Routing Problem

Diffusion Denoised Pruning

Echo State Networks

Liquid Time-Constant Networks

Sheaf Neural Networks

Tweedie's Formula

Perturbation Driven Generalization

Learning to Compress: Local Rank and Information Compression in Deep Neural Networks

An Image is Worth 16x16 Words: Transformers For Image Recognition at Scale

Training Compute-Optimal Large Language Models (Chinchilla)

Subword Regularization: Improving Neural Translation Models with Multiple Subword Candidates

Universal Language Model FineTuning for Text Classification

Music Transformer Generating Music With Long-Term Structure

Sequence to Sequence Learning with Neural Networks

Enriching Word Vectors with Subword Information