stefan-it

🤓

hacking 🎧

Stefan Schweter stefan-it

🤓

hacking 🎧

Researcher, M.Sc Computational Linguistics, Former student @ The Center for Information and Language Processing (CIS), LMU Munich

493 followers · 135 following

Near Munich, Germany
18:17 (UTC +02:00)
https://schweter.ml

Achievements

x3 x2 x3

BetaSend feedback

Achievements

x3 x2 x3

BetaSend feedback

Organizations

Block or Report

Block or report stefan-it

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Stars

occiglot / tech-report

Raw data, scripts, etc. to produce the tables and figures of our technical report

5 1 Updated May 2, 2024

kjslag / spacebyte

A byte-level decoder architecture that matches the performance of tokenized Transformers.

Jupyter Notebook 32 4 Updated Apr 24, 2024

unimorph / umLabeller

Inspection tool for characterizing the semantic compositionality of subword tokenization in English

Python 3 Updated Apr 23, 2024

ScandEval / ScandEval

Evaluation of language models on mono- or multilingual tasks.

Python 60 11 Updated May 5, 2024

malteos / turkish-lm-bias

Investigating Gender Bias in Turkish Language Models

Jupyter Notebook 1 Updated Apr 30, 2024

EleutherAI / improved-t5

Experiments for efforts to train a new and improved t5

Python 75 5 Updated Apr 15, 2024

Photooon / Multi-Level-Training-Framework

Official implementation of "A Multi-level Framework for Accelerating Training Transformer Models""

Python 4 Updated Apr 15, 2024

google-deepmind / recurrentgemma

Open weights language model from Google DeepMind, based on Griffin.

Python 513 19 Updated Apr 14, 2024

McGill-NLP / llm2vec

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Python 442 30 Updated May 3, 2024

lm-pub-quiz / BEAR

BEAR dataset

5 Updated Apr 8, 2024

impresso / newsagency-classification

Jupyter Notebook 1 Updated Apr 5, 2024

helpmefindaname / transformer-smaller-training-vocab

Temporary remove unused tokens during training to save ram and speed.

Python 20 2 Updated Apr 5, 2024

unum-cloud / uform

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️

Python 892 51 Updated May 1, 2024

erionc / albnlp

20 1 Updated Mar 26, 2024

MaartenGr / KeyBERT

Minimal keyword extraction with BERT

Python 3,229 327 Updated Mar 21, 2024

dobbersc / fundus-evaluation

Evaluation of the Fundus News Scraper https://github.com/flairNLP/fundus

Python 6 1 Updated Apr 2, 2024

hetzneronline / community-content

Hetzner Online Community Project

Markdown 264 326 Updated May 3, 2024

DataScienceUIBK / ChroniclingAmericaQA

ChroniclingAmericaQA: A Large-scale Question Answering Dataset based on Historical American Newspaper Pages

4 1 Updated Feb 10, 2024

VikParuchuri / surya

OCR, layout analysis, reading order, line detection in 90+ languages

Python 6,370 386 Updated May 5, 2024

Akeepers / LEAR

The implementation our EMNLP 2021 paper "Enhanced Language Representation with Label Knowledge for Span Extraction".

Python 111 13 Updated May 22, 2023

mainlp / maibaam-code

Code for preprocessing data for UD annotations and for tagging/parsing experiments of MaiBaam

Python 1 Updated Mar 13, 2024

trusthlt / eacl24-german-legal-questions

Data and code: "Answering legal questions from laymen in German civil law system", Büttner & Habernal, EACL'24

Python 6 2 Updated Mar 2, 2024

xai-org / grok-1

Grok open release

Python 48,186 8,165 Updated May 2, 2024

mlfoundations / scaling

Language models scale reliably with over-training and on downstream tasks

Jupyter Notebook 81 3 Updated Apr 2, 2024

gregorbachmann / Next-Token-Failures

Python 49 3 Updated Mar 12, 2024

UniversalDependencies / UD_Bavarian-MaiBaam

1 Updated May 5, 2024

lukasvoege / ZeroShot-step-by-step-distillation

master thesis project @HU-Berlin

Jupyter Notebook 2 1 Updated Dec 21, 2023

urchade / GLiNER

Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 24

Python 633 48 Updated Apr 25, 2024

gautierdag / tokenizer-bench

Code for the paper "Getting the most out of your tokenizer for pre-training and domain adaptation"

Python 9 1 Updated Feb 14, 2024

Oxen-AI / oxen-release

Lightning fast data version control system for structured and unstructured machine learning datasets. We aim to make versioning datasets as easy as versioning code.

Python 836 12 Updated Apr 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stefan Schweter stefan-it

Achievements

Achievements

Organizations

Block or report stefan-it

Stars

occiglot / tech-report

kjslag / spacebyte

unimorph / umLabeller

ScandEval / ScandEval

malteos / turkish-lm-bias

EleutherAI / improved-t5

Photooon / Multi-Level-Training-Framework

google-deepmind / recurrentgemma

McGill-NLP / llm2vec

lm-pub-quiz / BEAR

impresso / newsagency-classification

helpmefindaname / transformer-smaller-training-vocab

unum-cloud / uform

erionc / albnlp

MaartenGr / KeyBERT

dobbersc / fundus-evaluation

hetzneronline / community-content

DataScienceUIBK / ChroniclingAmericaQA

VikParuchuri / surya

Akeepers / LEAR

mainlp / maibaam-code

trusthlt / eacl24-german-legal-questions

xai-org / grok-1

mlfoundations / scaling

gregorbachmann / Next-Token-Failures

UniversalDependencies / UD_Bavarian-MaiBaam

lukasvoege / ZeroShot-step-by-step-distillation

urchade / GLiNER

gautierdag / tokenizer-bench

Oxen-AI / oxen-release