stefan-it

🤓

hacking 🎧

Stefan Schweter stefan-it

🤓

hacking 🎧

Researcher, M.Sc Computational Linguistics, Former student @ The Center for Information and Language Processing (CIS), LMU Munich

494 followers · 136 following

Near Munich, Germany
23:52 (UTC +02:00)
https://schweter.ml

Achievements

x3 x2 x3

BetaSend feedback

Achievements

x3 x2 x3

BetaSend feedback

Organizations

Block or Report

Block or report stefan-it

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Stars

cisnlp / TransMI

TransMI: A Framework to Create Strong Baselines from Multilingual Pretrained Language Models for Transliterated Data

Python 2 Updated May 17, 2024

bltlab / paranames

ParaNames: A multilingual resource for parallel names

Python 23 3 Updated May 9, 2024

pytorch / torchtitan

A native PyTorch Library for large model training

Python 1,151 105 Updated May 18, 2024

cisnlp / XAMPLER

XAMPLER: Learning to Retrieve Cross-Lingual In-Context Examples

Python 3 Updated May 9, 2024

google-research / causallm_icl

Python 7 Updated Jan 17, 2024

bminixhofer / zett

Code for Zero-Shot Tokenizer Transfer

Python 64 1 Updated May 14, 2024

occiglot / tech-report

Raw data, scripts, etc. to produce the tables and figures of our technical report

5 1 Updated May 6, 2024

kjslag / spacebyte

A byte-level decoder architecture that matches the performance of tokenized Transformers.

Jupyter Notebook 36 4 Updated Apr 24, 2024

unimorph / umLabeller

Inspection tool for characterizing the semantic compositionality of subword tokenization in English

Python 3 Updated Apr 23, 2024

ScandEval / ScandEval

Evaluation of language models on mono- or multilingual tasks.

Python 64 12 Updated May 17, 2024

malteos / turkish-lm-bias

Investigating Gender Bias in Turkish Language Models

Jupyter Notebook 1 Updated Apr 30, 2024

EleutherAI / improved-t5

Experiments for efforts to train a new and improved t5

Python 75 5 Updated Apr 15, 2024

Photooon / Multi-Level-Training-Framework

Official implementation of "A Multi-level Framework for Accelerating Training Transformer Models""

Python 5 Updated Apr 15, 2024

google-deepmind / recurrentgemma

Open weights language model from Google DeepMind, based on Griffin.

Python 528 19 Updated Apr 14, 2024

McGill-NLP / llm2vec

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Python 674 41 Updated May 16, 2024

lm-pub-quiz / BEAR

BEAR dataset

6 Updated Apr 8, 2024

impresso / newsagency-classification

Recognition of news agency mentions in historical news articles (BERT-based token classification).

Jupyter Notebook 1 Updated May 6, 2024

helpmefindaname / transformer-smaller-training-vocab

Temporary remove unused tokens during training to save ram and speed.

Python 20 2 Updated Apr 5, 2024

unum-cloud / uform

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️

Python 908 52 Updated May 1, 2024

erionc / albnlp

20 1 Updated Mar 26, 2024

MaartenGr / KeyBERT

Minimal keyword extraction with BERT

Python 3,246 328 Updated Mar 21, 2024

dobbersc / fundus-evaluation

Evaluation of the Fundus News Scraper https://github.com/flairNLP/fundus

Python 6 1 Updated Apr 2, 2024

hetzneronline / community-content

Hetzner Online Community Project

Markdown 264 325 Updated May 7, 2024

DataScienceUIBK / ChroniclingAmericaQA

ChroniclingAmericaQA: A Large-scale Question Answering Dataset based on Historical American Newspaper Pages

4 1 Updated Feb 10, 2024

VikParuchuri / surya

OCR, layout analysis, reading order, line detection in 90+ languages

Python 6,875 419 Updated May 18, 2024

Akeepers / LEAR

The implementation our EMNLP 2021 paper "Enhanced Language Representation with Label Knowledge for Span Extraction".

Python 111 13 Updated May 22, 2023

mainlp / maibaam-code

Code for preprocessing data for UD annotations and for tagging/parsing experiments of MaiBaam

Python 1 Updated Mar 13, 2024

trusthlt / eacl24-german-legal-questions

Data and code: "Answering legal questions from laymen in German civil law system", Büttner & Habernal, EACL'24

Python 6 2 Updated Mar 2, 2024

xai-org / grok-1

Grok open release

Python 48,487 8,217 Updated May 2, 2024

mlfoundations / scaling

Language models scale reliably with over-training and on downstream tasks

Jupyter Notebook 82 4 Updated Apr 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stefan Schweter stefan-it

Achievements

Achievements

Organizations

Block or report stefan-it

Stars

cisnlp / TransMI

bltlab / paranames

pytorch / torchtitan

cisnlp / XAMPLER

google-research / causallm_icl

bminixhofer / zett

occiglot / tech-report

kjslag / spacebyte

unimorph / umLabeller

ScandEval / ScandEval

malteos / turkish-lm-bias

EleutherAI / improved-t5

Photooon / Multi-Level-Training-Framework

google-deepmind / recurrentgemma

McGill-NLP / llm2vec

lm-pub-quiz / BEAR

impresso / newsagency-classification

helpmefindaname / transformer-smaller-training-vocab

unum-cloud / uform

erionc / albnlp

MaartenGr / KeyBERT

dobbersc / fundus-evaluation

hetzneronline / community-content

DataScienceUIBK / ChroniclingAmericaQA

VikParuchuri / surya

Akeepers / LEAR

mainlp / maibaam-code

trusthlt / eacl24-german-legal-questions

xai-org / grok-1

mlfoundations / scaling