Skip to content
View joelthchao's full-sized avatar

Organizations

@Dcard

Block or report joelthchao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Approximate Nearest Neighbor Search for Sparse Data in Python!

Python 919 145 Updated Oct 2, 2020

A game theoretic approach to explain the output of any machine learning model.

Jupyter Notebook 23,301 3,325 Updated Jan 28, 2025

📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.

27,633 3,721 Updated Jul 18, 2024

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 21,172 2,733 Updated Aug 15, 2024

Text preprocessing, representation and visualization from zero to hero.

Python 2,895 239 Updated Aug 29, 2023

Path to a Software Architect

8,738 778 Updated Jun 1, 2023

A set of standard document templates.

2,021 164 Updated Oct 2, 2023

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 287,325 47,883 Updated Dec 2, 2024

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Python 6,183 1,177 Updated May 28, 2023

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

Python 3,259 570 Updated Apr 14, 2023

Simple Python version management

Roff 40,465 3,094 Updated Jan 19, 2025

a pyenv plugin to manage virtualenv (a.k.a. python-virtualenv)

Shell 6,446 410 Updated Jan 1, 2025

Line bot that checks if a message contains internet rumor.

TypeScript 78 17 Updated Jan 27, 2025

GraphQL API server for clients like rumors-site and rumors-line-bot

JavaScript 115 28 Updated Jan 20, 2025

High level Python client for Elasticsearch

Python 3,850 804 Updated Jan 8, 2025

This repository stores slides for a tutorial on variational inference for NLP audiences.

TeX 298 54 Updated Jul 22, 2019

A TensorFlow Implementation of the Transformer: Attention Is All You Need

Python 4,319 1,304 Updated May 21, 2023

BERT with SentencePiece for Japanese text.

Jupyter Notebook 497 93 Updated Feb 15, 2021

TensorFlow code and pre-trained models for BERT

Python 38,572 9,653 Updated Jul 23, 2024

🤔 Search & Replace unicode emojis. Supports Unicode 10

Python 19 5 Updated Oct 10, 2017

TensorFlow tutorials and best practices.

8,621 905 Updated Oct 22, 2020

Software in C and data files for the popular GloVe model for distributed word representations, a.k.a. word vectors or embeddings

C 6,938 1,524 Updated Nov 23, 2024

Pre-trained word vectors of 30+ languages

Python 2,216 392 Updated Oct 11, 2018

Tensorflow implementation of contextualized word representations from bi-directional language models

Python 1,619 452 Updated Mar 4, 2023

A collection of Scala best practices

4,391 624 Updated Nov 9, 2022

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 10,512 1,188 Updated Dec 1, 2024

a simple Scala CLI parsing library

Scala 681 57 Updated Dec 7, 2024

Solutions to LeetCode problems; updated daily. Subscribe to my YouTube channel for more.

Java 3,871 1,290 Updated Jan 26, 2025

Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)

5,827 970 Updated Feb 15, 2023

Pytorch NLP library based on FastAI

Python 283 49 Updated Jul 4, 2018
Next
Showing results