Misc.

Something I want to share:

  1. What should we expect on AI developments? - The rapid evolution from research to product deployment

  2. My understanding and experience with Audio-based Large Language Models - Bridging speech and language understanding

  3. AI has become a product, what's next for academia and science? - Balancing fundamental research with practical applications

  4. What are the pros and cons for working and living in Singapore? - A multicultural hub for AI research and innovation

Readings (I record some of my readings here.)

  1. 2025-02: A post on “Always bet on text” from Graydon Hoare. The original Post from 2014.

  2. 2025-02: 7 Different Convolutions for designing CNNs that will Level-up your Computer Vision project. Link.

  3. 2025-02: Conformer: Convolution-augmented Transformer for Speech Recognition

  4. 2025-02: SEAVL: TBD

  5. 2025-02: SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages

  6. 2025-01: An introduction to ASR and TTS? Book material.

  7. 2025-01: How does Mix-of-Experts work? A good tutorial blog in 2023.