203 Repositories
C++ (or C) performance Libraries
SHARK - High Performance Machine Learning for CPUs, GPUs, Accelerators and Heterogeneous Clusters
SHARK Communication Channels GitHub issues: Feature requests, bugs etc Nod.ai SHARK Discord server: Real time discussions with the nod.ai team and oth
PaRSEC: the Parallel Runtime Scheduler and Execution Controller for micro-tasks on distributed heterogeneous systems.
PaRSEC is a generic framework for architecture aware scheduling and management of micro-tasks on distributed, GPU accelerated, many-core heterogeneous architectures. PaRSEC assigns computation threads to the cores, GPU accelerators, overlaps communications and computations and uses a dynamic, fully-distributed scheduler based on architectural features such as NUMA nodes and algorithmic features such as data reuse.
A modern thread pool implementation based on C++20
thread-pool A simple, functional thread pool implementation using pure C++20. Features Built entirely with C++20 Enqueue tasks with or without trackin
Browser and NodeJS Web Assembly audio decoder libraries that are highly optimized for size and performance.
WASM Audio Decoders WASM Audio Decoders is a collection of Web Assembly audio decoder libraries that are highly optimized for browser use. Each module
A high performance fiber RPC network framework. 高性能协程RPC网络框架
ACID: 高性能协程RPC框架 学习本项目需要有一定的C++,网络,RPC知识 项目依赖 1.项目用到了大量C++17/20新特性,如constexpr if的编译期代码生成,基于c++20 coroutine的无栈协程状态机解析 URI 和 HTTP 协议等。注意,必须安装g++-11,否则不支
The dgSPARSE Library (Deep Graph Sparse Library) is a high performance library for sparse kernel acceleration on GPUs based on CUDA.
dgSPARSE Library Introdution The dgSPARSE Library (Deep Graph Sparse Library) is a high performance library for sparse kernel acceleration on GPUs bas
Fast UI Draw is a library that provides a higher performance Canvas interface.
Fast UI Draw is a library that provides a higher performance Canvas interface. It is designed so that it always draws using a GPU.
A portable fork of the high-performance regular expression matching library
Vectorscan? A fork of Intel's Hyperscan, modified to run on more platforms. Currently ARM NEON/ASIMD is 100% functional, and Power VSX are in developm
Lightweight OpenCL-Wrapper to greatly simplify OpenCL software development with C++ while keeping functionality and performance
OpenCL-Wrapper OpenCL is the most powerful programming language ever created. Yet the OpenCL C++ bindings are very cumbersome and the code overhead pr
MozJPEG improves JPEG compression efficiency achieving higher visual quality and smaller file sizes at the same time
Mozilla JPEG Encoder Project MozJPEG improves JPEG compression efficiency achieving higher visual quality and smaller file sizes at the same time. It
High performance physically based renderer in C++11
The Tungsten Renderer About Tungsten is a physically based renderer originally written for the yearly renderer competition at ETH. It simulates full l
Oboe is a C++ library that makes it easy to build high-performance audio apps on Android.
Oboe Oboe is a C++ library which makes it easy to build high-performance audio apps on Android. It was created primarily to allow developers to target
Kit: a magical, high performance programming language, designed for game development
Kit: a magical, high performance programming language, designed for game development
Thoughts about entity-component-system
About Warning: This is not a complete production-ready library for entity-component-system. This is only my thoughts about how the modern entity-compo
A high-performance and easy-to-use C++ network library.
pine A high-performance and easy-to-use C++ network library. Now this is just a toy library for education purpose, do not use in production. example A
High performance build system for Windows, OSX and Linux. Supporting caching, network distribution and more.
FASTBuild FASTBuild is a build system for Windows, OSX and Linux, supporting distributed compilation and object caching. It is used by many game devel
An eventing framework for building high performance and high scalability systems in C.
NOTE: THIS PROJECT HAS BEEN DEPRECATED AND IS NO LONGER ACTIVELY MAINTAINED As of 2019-03-08, this project will no longer be maintained and will be ar
A high performance, middleware oriented C++14 http web framework please use matt-42/lithium instead
A high performance, middleware oriented C++14 http web framework please use matt-42/lithium instead
Full-featured high-performance event loop loosely modelled after libevent
libev is a high-performance event loop/event model with lots of features. (see benchmark at http://libev.schmorp.de/bench.html) ABOUT Homepage:
Lightweight profiler library for c++
easy_profiler About Key features Usage Integration General build system CMake Inserting blocks Storing variables Collect profiling data Streaming over
A high-performance MongoDB driver for C
mongo-c-driver About mongo-c-driver is a project that includes two libraries: libmongoc, a client library written in C for MongoDB. libbson, a library
libmdbx is an extremely fast, compact, powerful, embedded, transactional key-value database, with permissive license
One of the fastest embeddable key-value ACID database without WAL. libmdbx surpasses the legendary LMDB in terms of reliability, features and performance.
High performance library for creating, modiyfing and parsing PDF files in C++
Welcome to PDF-Writer. A Fast and Free C++ Library for Creating, Parsing an Manipulating PDF Files and Streams. Documentation is available here. Proje
SeqAn's official repository.
ATTENTION: SeqAn3 is out and hosted in a different repository: https://github.com/seqan/seqan3 All new applications should be based on SeqAn3 and all
CppCMS - High Performance C++ Web Framework
CppCMS - High Performance C++ Web Framework What is CppCMS? CppCMS is a Free High Performance Web Development Framework (not a CMS) aimed at Rapid Web
A coupling library for partitioned multi-physics simulations, including, but not restricted to fluid-structure interaction and conjugate heat transfer simulations.
A coupling library for partitioned multi-physics simulations, including, but not restricted to fluid-structure interaction and conjugate heat transfer simulations.
🖼️ A writeable in-memory Image JSI Host Object
🖼️ react-native-jsi-image 🏗️ This library is work in progress! 🏗️ A writeable in-memory Image JSI Host Object. JSI-Image is a modern library that p
A high performance, shared memory, lock free, cross platform, single file, no dependencies, C++11 key-value store
SimDB A high performance, shared memory, lock free, cross platform, single file, no dependencies, C++11 key-value store. SimDB is part of LAVA (Live A
Visualization Library is a C++ middleware for high-performance 2D and 3D graphics applications based on OpenGL 1.x-4.x supporting Windows, Linux and Mac OS X.
Visualization Library 2.2 Gallery About Visualization Library is a C++ middleware for high-performance 2D and 3D graphics applications based on the in
ObjectBox C and C++: super-fast database for objects and structs
ObjectBox Embedded Database for C and C++ ObjectBox is a superfast C and C++ database for embedded devices (mobile and IoT), desktop and server apps.
TensorRT is a C++ library for high performance inference on NVIDIA GPUs and deep learning accelerators.
TensorRT Open Source Software This repository contains the Open Source Software (OSS) components of NVIDIA TensorRT. Included are the sources for Tens
Thread-pool-cpp - High performance C++11 thread pool
thread-pool-cpp It is highly scalable and fast. It is header only. No external dependencies, only standard library needed. It implements both work-ste
ThreadPool - A fastest, exception-safety and pure C++17 thread pool.
Warnings Since commit 468129863ec65c0b4ede02e8581bea682351a6d2, I move ThreadPool to C++17. (To use std::apply.) In addition, the rule of passing para
logog is a portable C++ library to facilitate logging of real-time events in performance-oriented applications
logog is a portable C++ library to facilitate logging of real-time events in performance-oriented applications, such as games. It is especially appropriate for projects that have constrained memory and constrained CPU requirements.
Fft-benchmark - A benchmark for comparison of FFT algorithms performance
FFT benchmark A benchmark for comparison of FFT algorithms performance. Currently supports Intel IPP, KFR, FFTW and KissFFT. Requires: Clang 6.0+ (GCC
Perf-ninja - This is an online course where you can learn and master the skill of low-level performance analysis and tuning.
Performance Ninja Class This is an online course where you can learn to find and fix low-level performance issues, for example CPU cache misses and br
Parca-agent - eBPF based always-on profiler auto-discovering targets in Kubernetes and systemd, zero code changes or restarts needed!
Parca Agent Parca Agent is an always-on sampling profiler that uses eBPF to capture raw profiling data with very low overhead. It observes user-space
Adorad - Fast, Expressive, & High-Performance Programming Language for those who dare
The Adorad Language Adorad | Documentation | Contributing | Compiler design Key Features of Adorad Simplicity: the language can be learned in less tha
Sagan - a multi-threads, high performance log analysis engine
Sagan - Sagan is a multi-threads, high performance log analysis engine. At it's core, Sagan similar to Suricata/Snort but with logs rather than network packets.
High-performance time-series aggregation for PostgreSQL
PipelineDB has joined Confluent, read the blog post here. PipelineDB will not have new releases beyond 1.0.0, although critical bugs will still be fix
Beringei is a high performance, in-memory storage engine for time series data.
** THIS REPO HAS BEEN ARCHIVED AND IS NO LONGER BEING ACTIVELY MAINTAINED ** Beringei A high performance, in memory time series storage engine In the
C++ high-performance gym environment framework
gym_cpp author: yeting email : [email protected] C++ high-performance gym environment framework Dependence apt install python3 python3-dev cmake pip3
Smartstreaming is a high-performance and scalable streaming media server.
1. introduction Smartstreaming is a high-performance and scalable streaming media server. 2. design | io | Coroutine | | transport | tcp/udp/srt/quic
PGSpider: High-Performance SQL Cluster Engine for distributed big data.
PGSpider: High-Performance SQL Cluster Engine for distributed big data.
BLAS-like Library Instantiation Software Framework
Contents Introduction Education and Learning What's New What People Are Saying About BLIS Key Features How to Download BLIS Getting Started Example Co
A program developed using MPI for distributed computation of Histogram for large data and their performance anaysis on multi-core systems
mpi-histo A program developed using MPI for distributed computation of Histogram for large data and their performance anaysis on multi-core systems. T
Emergency alert and tracer for realtime high-performance computing app (work in progress, currently supported env is only Linux x86-64).
HPC Emerg Emergency alert and tracer for realtime high-performance computing app (work in progress, currently supported env is only Linux x86-64). Exa
Cloud-native high-performance edge/middle/service proxy
Cloud-native high-performance edge/middle/service proxy Envoy is hosted by the Cloud Native Computing Foundation (CNCF). If you are a company that wan
ncnn is a high-performance neural network inference framework optimized for the mobile platform
ncnn ncnn is a high-performance neural network inference computing framework optimized for mobile platforms. ncnn is deeply considerate about deployme
Trilogy is a client library for MySQL-compatible database servers, designed for performance, flexibility, and ease of embedding.
Trilogy is a client library for MySQL-compatible database servers, designed for performance, flexibility, and ease of embedding.
Real Time, High performance BOT detection and protection
REAL-TIME BOT PROTECTION CHALLENGE IronFox https://innovera.ir IronFox is a real-time and high performance bot protection, that using Nginx as a reve
Airspy HF+, a high performance software defined radio for the HF and VHF bands.
User mode driver for Airspy HF+ This repository contains host software (Linux/Windows) for Airspy HF+, a high performance software defined radio for t
A stable nginx module for SSL/TLS ja3 fingerprint, with high performance.
nginx-ssl-fingerprint A stable nginx module for SSL/TLS ja3 fingerprint, with high performance. Description This module adds new nginx variables for t
Visual-inertial-wheel fusion odometry, better performance in scenes with drastic changes in light
VIW-Fusion An visual-inertial-wheel fusion odometry VIW-Fusion is an optimization-based viusla-inertial-wheel fusion odometry, which is developed as a
A C++ Class and Template Library for Performance Critical Applications
Spirick Tuning A C++ Class and Template Library for Performance Critical Applications Optimized for Performance The Spirick Tuning library provides a
Flexible, portable, high-performance bit fields C++ library. unsigned a:13 becomes F13 a;
C-plus-plus-library-bit-fields Flexible, portible, high-performance bit fields C++ library. The bit fields are specified with a dummy structure where
Blazing-fast Expression Templates Library (ETL) with GPU support, in C++
Expression Templates Library (ETL) 1.3.0 ETL is a header only library for C++ that provides vector and matrix classes with support for Expression Temp
Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.
The Diff Match and Patch libraries offer robust algorithms to perform the operations required for synchronizing plain text. Diff: Compare two blocks o
A low-latency LRU approximation cache in C++ using CLOCK second-chance algorithm. Multi level cache too. Up to 2.5 billion lookups per second.
LruClockCache Low-latency LRU approximation cache in C++ using CLOCK second-chance algorithm. (see wiki for details) using MyKeyType = std::string; us
Gigantua: Single Threaded - 2 Billion Moves/s Chess position enumerator
Description for the code: https://www.codeproject.com/Articles/5313417/Worlds-fastest-Bitboard-Chess-Movegenerator Gigantua Chess PerfT Command line o
A repository for I/O ring demos, use cases and performance testing on Windows
IoRing_Demos A repository for I/O ring demos, use cases and performance testing on Windows Overview of I/O ring on Windows: https://windows-internals.
Experimental and Comparative Performance Measurements of High Performance Computing Based on OpenMP and MPI
High-Performance-Computing-Experiments Experimental and Comparative Performance Measurements of High Performance Computing Based on OpenMP and MPI 实验结
A cross-platform network learning demos. Like high-performance http server
Network-Learn A cross-platform network learning demos (toys). And I try not to use 3rd-party libraries. Welcome to try it out and leave your comments.
pgagroal is a high-performance protocol-native connection pool for PostgreSQL.
pgagroal is a high-performance protocol-native connection pool for PostgreSQL.
PHP Encoder, protect PHP scripts in PHP 8 and PHP 7, High Performance, Compitable with X86_64, MIPS, ARM platform and Ubuntu/Centos/OpenWRT system.
What's FRICC2? FRICC2 is a PHP Script encryption tool. When you are developing a commercial software using PHP, the script can be distributed as encry
SIMULATeQCD is a multi-GPU Lattice QCD framework that makes it simple and easy for physicists to implement lattice QCD formulas while still providing the best possible performance.
SIMULATeQCD a SImple MUlti-GPU LATtice code for QCD calculations SIMULATeQCD is a multi-GPU Lattice QCD framework that makes it simple and easy for ph
Instagram's experimental performance oriented greenfield implementation of Python.
Welcome to Skybison! Skybison is experimental performance-oriented greenfield implementation of Python 3.8. It contains a number of performance optimi
Convenient, high-performance RGB color and position control for console output
Oof (omnipotent output friend) It's common for C++ programs to write output to the console. But consoles are far more capable than what they are usual
High Performance Linux C++ Network Programming Framework based on IO Multiplexing and Thread Pool
Kingpin is a C++ network programming framework based on TCP/IP + epoll + pthread, aims to implement a library for the high concurrent servers and clie
CPU Performance Evaluation and Execution Time Prediction Using Narrow Spectrum Benchmarking
This is a simple implementation of Saavedra-Barrera's paper SAAVEDRA-BARRERA R H. CPU Performance Evaluation and Execution Time Prediction Using Narrow Spectrum Benchmarking[D/OL]. UCB/CSD92-684. EECS Department, University of California, Berkeley, 1992.
Fast C++ IPC using shared memory
Fast C++ IPC using shared memory
High Performance 3D Game Engine, with a high emphasis on Rendering
Electro High Performance 3D Game Engine, with a high emphasis on Rendering MainFeatures Rendering PBR Renderer (Cook–Torrance GGX) IBL (Image Based Li
CPS162 Fall 2021 Group1: Gabriela Fernandez Leon, Tan Dinh, Alan Baxley
Big-O-Notation Table of contents What is Big O Notation Asymptotic Growth and the Formal Mathematical Definition of Big O Types of Time Complexity Ana
C++-based high-performance parallel environment execution engine for general RL environments.
EnvPool is a highly parallel reinforcement learning environment execution engine which significantly outperforms existing environment executors. With
⚡High-performance rendering for python
ZenGL ZenGL is a minimalist Python module providing exactly one way to render scenes with OpenGL. pip install zengl Documentation zengl on Github zen
PendulumSynth is an on-going and open-source project, running on Arduino platform with the goal of mixing real-world physics into music composition and musical performance.
PendulumSynth PendulumSynth is an on-going and open-source project, running on Arduino platform with the goal of mixing real-world physics into music
A small program that acts as a C profiler.
Simple Performance Test Description This is a simple program in C for performance testing which only works for x86 and x86_64 systems. A for loop is e
Data Structures and Algorithms course (IIC2133-PUC)
Data-Structures-And-Algorithms This are the homeworks of Jose Antonio Castro from Data Structures and Algorithms course (IIC2133) in Pontificia Univer
bsnes is a Super Nintendo (SNES) emulator focused on performance, features, and ease of use.
bsnes is a Super Nintendo (SNES) emulator focused on performance, features, and ease of use.
U++ is a C++ cross-platform rapid application development framework focused on programmer's productivity. It includes a set of libraries (GUI, SQL, Network etc.), and integrated development environment (TheIDE).
Ultimate++ Ultimate++ is a C++ cross-platform rapid application development framework focused on programmers productivity. It includes a set of librar
WineFS is a file system for Persistent Memory (PM) which is aimed at maximizing the performance of memory-mapped applications.
WineFS WineFS is a file system for Persistent Memory (PM) which is aimed at maximizing the performance of memory-mapped applications. WineFS uses a no
This algorithm is amazing and take a high performance to search something under array.
Sequential Binary Algorithm O(n) Algoritmo Este é um algoritmo de complexidade O(log n), que possui uma alta performance em percorrer um vetor de inte
Visual odometry package based on hardware-accelerated NVIDIA Elbrus library with world class quality and performance.
Isaac ROS Visual Odometry This repository provides a ROS2 package that estimates stereo visual inertial odometry using the Isaac Elbrus GPU-accelerate
Cista is a simple, high-performance, zero-copy C++ serialization & reflection library.
Simple C++ Serialization & Reflection. Cista++ is a simple, open source (MIT license) C++17 compatible way of (de-)serializing C++ data structures. Si
High-performance specialized replacements for PHP's pack() and unpack() functions
ext-encoding High-performance specialized replacements for PHP's pack() and unpack() functions Under a profiler, it becomes obvious that PHP's pack()
C++ Matrix -- High performance and accurate (e.g. edge cases) matrix math library with expression template arithmetic operators
Matrix This is a math and arithmetic matrix library. It has stood many years of performing in mission critical production for financial systems. It ha
ArrayFire: a general purpose GPU library.
ArrayFire is a general-purpose tensor library that simplifies the process of software development for the parallel architectures found in CPUs, GPUs,
An Out-of-the-Box TensorRT-based Framework for High Performance Inference with C++/Python Support
An Out-of-the-Box TensorRT-based Framework for High Performance Inference with C++/Python Support
A high performance C++14 library for effortlessly reading and writing UBJSON
UbjsonCpp A high performance C++14 library for effortlessly reading and writing UBJSON This library implements UBJSON Draft 12 and Value semmantics Ab
High performance server-side application framework
Seastar Introduction SeaStar is an event-driven framework allowing you to write non-blocking, asynchronous code in a relatively straightforward manner
A headers only high performance C++ middleware framework/lib. See README for details.
# README # hmbdc is an open source headers only C++ framework and library built on top of various real-world tested lockfree algorithms that facilit
A memory allocation program, it is used for doing an experiment to find out the detail of Microsoft Windows taskmgr performance information
memory-allocation-test A memory allocation program, it is used for doing an experiment to find out the detail of Microsoft Windows taskmgr performance
A C++17 message passing library based on MPI
MPL - A message passing library MPL is a message passing library written in C++17 based on the Message Passing Interface (MPI) standard. Since the C++
Multi-backend implementation of SYCL for CPUs and GPUs
hipSYCL - a SYCL implementation for CPUs and GPUs hipSYCL is a modern SYCL implementation targeting CPUs and GPUs, with a focus on leveraging existing
A C++ GPU Computing Library for OpenCL
Boost.Compute Boost.Compute is a GPU/parallel-computing library for C++ based on OpenCL. The core library is a thin C++ wrapper over the OpenCL API an
High-performance Fortran program to calculate polarizability and inverse dielectric response function.
DielectricKit First-principles HPC toolkit for simulating dielectric responses Introduction DielectricKit is a high-performance computing toolkit to c
An SQLite binding for node.js with built-in encryption, focused on simplicity and (async) performance
Description An SQLite (more accurately SQLite3MultipleCiphers) binding for node.js focused on simplicity and (async) performance. When dealing with en
A C++ data container replicating std::stack functionality but with better performance than standard library containers in a stack context.
plf::stack A data container replicating std::stack functionality but with better performance than standard library containers in a stack context. C++9
An unordered C++ data container providing fast iteration/insertion/erasure while maintaining pointer/iterator validity to non-erased elements regardless of insertions/erasures. Provides higher-performance than std:: library containers for high-modification scenarios with unordered data.
plf::colony An unordered data container providing fast iteration/insertion/erasure while maintaining pointer/iterator/reference validity to non-erased