Skip to content
View stefan-it's full-sized avatar
🤓
hacking 🎧
🤓
hacking 🎧

Organizations

@flairNLP @Hugging-Face-Supporter @GermanT5 @Hugging-Face-Helping-Hand @LEL-A
Block or Report

Block or report stefan-it

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
13 stars written in C++
Clear filter

A toolkit for making real world machine learning and data analysis applications in C++

C++ 13,082 3,321 Updated May 12, 2024

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 9,592 1,118 Updated May 10, 2024

MITIE: library and tools for information extraction

C++ 2,905 536 Updated Sep 1, 2022

General purpose unsupervised sentence representations

C++ 1,189 256 Updated Aug 3, 2022

A lightweight header-only library for using Keras (TensorFlow) models in C++.

C++ 1,048 235 Updated May 16, 2024

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

C++ 1,013 128 Updated Apr 23, 2024

Encodes a file into a video format to store on a cloud video hosting service

C++ 881 63 Updated Nov 16, 2023

Quantized word vectors that take 8x-16x less space than regular word vectors

C++ 755 36 Updated Mar 31, 2020

Fast Deep Learning Library (DLL) for C++ (ANNs, CNNs, RBMs, DBNs...)

C++ 666 161 Updated Feb 9, 2024

Fast Block Sparse Matrices for Pytorch

C++ 543 35 Updated Jan 21, 2021

A word2vec negative sampling implementation with correct CBOW update.

C++ 261 18 Updated Nov 8, 2021

terashuf shuffles multi-terabyte text files using limited memory

C++ 197 15 Updated Feb 5, 2023

Code for EMNLP2021 paper "Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training"

C++ 19 2 Updated Nov 12, 2021