Paramveer singh

Paramveer singh

@Paramveersingh-S
12
Followers
92
Following
87
Public Repos
0
Private Repos

Language Breakdown

Lines of code distribution across 53 owned repositories

4.6M Total LOC
TypeScript
1,392,854 lines
30.0%
N/A
Jupyter Notebook
1,370,579 lines
29.5%
N/A
Python
1,023,176 lines
22.0%
N/A
JavaScript
294,096 lines
6.3%
N/A
C
197,940 lines
4.3%
N/A
Other
368,879 lines
7.9%
N/A
M

M-Shaped Developer

M-shaped

Multi-specialist across TypeScript, Jupyter Notebook, Python

TypeScript
Jupyter Notebook
Python
JavaScript
C

Collaboration Network

Global Impact visualization

LIVE
Paramveer singh
0 active collaborators

Repos

91

PRs

0

Growth

+18%

Top Collaborators

No collaborator data yet.

Coding Streak

Contribution activity over the past year

3 days
446
Contributions
368
Commits
3
Pull Requests
Jun Jul Aug Sep Oct Nov Dec Jan Feb Mar Apr May Jun
Mo
We
Fr
Based on GitHub activity
Less
More

Top Repositories

fluffy-harness

Welcome to the Pi Agent Improvement Suite! This repository provides enterprise-grade extensions, integrations, and core patches for the earendil-works/pi coding agent. It drastically enhances Pi's security, operational reliability, and ecosystem connectivity.

1 0
TypeScript
pi

AI agent toolkit: unified LLM API, agent loop, TUI, coding agent CLI

1 0
TypeScript
Dynamic-Fraud-Detection-System

The Dynamic Real-Time Fraud Detection System is an end-to-end, production-ready machine learning platform designed to detect and block fraudulent financial transactions in under 50 milliseconds.

1 0
Python
Semantic-search-RAG

A production-grade, distributed microservices platform for Enterprise Semantic Search and Retrieval-Augmented Generation (RAG). Capable of ingesting millions of documents, indexing them via Hybrid Search (BM25 + Dense Vectors + custom SPLADE v2), and serving sub-100ms LLM-powered answer generation.

1 0
Python
Lightweight-LLM-Inference-Engine

LiteLlama is a high-performance, from-scratch Large Language Model inference engine tailored for the LLaMA-3 model family.

1 0
Python
Sparse-Attention

Sparse-Attention provides custom CUDA/Triton kernels that replace the standard O ( N 2 ) dense attention mechanism with a structured block-sparse pattern. By combining heterogeneous attention constraints (local sliding windows, global stride landmarks, and prefix context), it enables Large Language Models to scale context lengths up to 1M tokens

1 0
Python
Serverless-pastebin

A production-grade Serverless Pastebin infrastructure deployed on AWS using Terraform. Designed to be fully scalable, highly observable, and strictly secure while leveraging the AWS Free Tier.

1 0
HCL
triton

Development repository for the Triton language and compiler

1 0
MLIR
Triton-MLOps

The Triton-Based Fused Operator Suite targets the most critical bottlenecks in Large Language Model (LLM) inference pipelines. Written purely in OpenAI Triton, these kernels bypass PyTorch's ATen overhead by aggressively fusing operations, maintaining data in highly performant SRAM, and dramatically reducing Global Memory (GMEM) round-trips.

1 0
Python
port-1
1 0
TypeScript

Open Source Impact

Contributions to external projects

1 merged PRs
Contributed to 2 repositories