renorm-native

🚀 Overview

renorm-native is a PyTorch-compatible neural network module designed to improve numerical stability in deep learning models.

It provides transformer-ready layers that are robust to:

Training instability (NaNs / exploding gradients)
Irregular tensor shapes and sequence lengths
Mixed CPU/GPU execution environments
Memory pressure in large-scale workloads

It is designed to be a drop-in architectural component for modern deep learning pipelines.

📦 Installation

Install from PyPI:

pip install renorm-native

Upgrade to latest version:

pip install --upgrade renorm-native

⚡ Quick Start (30 seconds)

Transformer Layer Example

import torch
from renorm import RenormTransformerLayer

# Initialize layer
layer = RenormTransformerLayer(dim=512, heads=8)

# Dummy input: (batch, sequence, features)
x = torch.randn(2, 16, 512)

# Forward pass
y = layer(x)

print(y.shape)

Expected Output

torch.Size([2, 16, 512])

🧠 Core API

1. RenormTransformerLayer

A lightweight transformer block with built-in normalization stability.

RenormTransformerLayer(
    dim: int,
    heads: int,
    eps: float = 1e-5
)

Parameters:

dim: Hidden dimension size
heads: Number of attention heads
eps: Numerical stability constant

2. RenormLinear

A stable replacement for torch.nn.Linear.

from renorm.layers import RenormLinear

Example:

layer = RenormLinear(256, 128)
y = layer(torch.randn(4, 256))

⚙️ Device Compatibility

Automatically works across:

CPU (Windows / Linux / Mac)
CUDA (NVIDIA GPUs)
Mixed environments (fallback-safe execution)

Example:

device = "cuda" if torch.cuda.is_available() else "cpu"

layer = RenormTransformerLayer(dim=512, heads=8).to(device)
x = torch.randn(2, 16, 512).to(device)

y = layer(x)

🧪 Minimal Validation Test

Run this to verify installation:

python -c "from renorm import RenormTransformerLayer; print(RenormTransformerLayer(dim=256, heads=4))"

Expected behavior: no errors and model prints successfully.

🏗 Architecture Summary

renorm-native uses a dual-path execution design:

CUDA Path (GPU):
- Optimized tensor execution path
- High-performance kernel routing (where available)
CPU Path (Fallback):
- Stable numerical execution engine
- Strict variance preservation for stability

This ensures consistent behavior across heterogeneous compute environments.

📊 Stability Design Principles

1. Variance Stabilization

Prevents numerical collapse in deep stacks by maintaining bounded activation scaling.

2. Memory Safety

Ensures gradient computation remains isolated from unsafe tensor views in dynamic graphs.

3. Execution Portability

Same model behavior across CPU and GPU environments.

📌 Example Use Case

Transformer models (LLMs)
Time-series forecasting systems
Anomaly detection pipelines
Edge-device inference systems
Low-memory GPU environments

⚠️ Notes

Requires PyTorch ≥ 2.0
Python ≥ 3.10 recommended
CUDA optional but supported

📄 License

MIT License — see LICENSE for details.

🤝 Contributing

Contributions, issues, and improvements are welcome.

🔗 Project

Maintained by the renorm-native team.

🧩 Enterprise / Production Add-On Section

🏢 Enterprise / Production Usage

renorm-native can be used in production systems requiring deterministic numerical stability under high load.

Typical deployment environments:

GPU inference clusters (CUDA-enabled)
On-prem ML pipelines
Edge inference systems
Distributed training environments (PyTorch DDP)

🔐 Enterprise License Mode (Optional)

Some builds may enable enterprise validation for regulated or production deployments.

Environment Variable

export RENORM_ENTERPRISE_KEY="your_token_here"

Format

base64_payload.hex_hmac_signature

Programmatic Validation

from renorm.auth import check_enterprise_license

check_enterprise_license()

Failure Modes

Condition	Behavior
Missing key	Raises PermissionError
Invalid signature	Raises PermissionError
Expired token	Raises TimeoutError

⚙️ Production Integration Pattern

Recommended structure in production pipelines:

import torch
from renorm import RenormTransformerLayer

def build_model():
    model = RenormTransformerLayer(dim=1024, heads=16)
    return model

def forward_pass(model, x):
    return model(x)

🧪 CI / Validation Test

Run a deterministic sanity check:

python -c "
import torch
from renorm import RenormTransformerLayer

layer = RenormTransformerLayer(dim=256, heads=4)
x = torch.randn(2, 8, 256)
y = layer(x)

assert y.shape[-1] == 256
print('OK')
"

📊 Performance Notes

renorm-native is optimized for:

Stable forward/backward propagation under long sequence lengths
Reduced numerical drift in deep stacks
Consistent execution across heterogeneous compute backends

It is not intended as a raw speed-optimized kernel replacement for PyTorch primitives.

🔄 Compatibility Matrix

Environment	Status
CPU (Windows)	✅ Supported
CPU (Linux)	✅ Supported
CUDA 11+	✅ Supported
MPS (Apple Silicon)	⚠️ Experimental
Distributed training (DDP)	✅ Compatible

🧠 Design Philosophy

renorm-native prioritizes:

Numerical correctness over raw speed
Stability over aggressive optimization
Cross-device consistency over hardware specialization

It is designed to behave predictably under:

gradient explosion conditions
low precision arithmetic
fragmented tensor memory layouts

📦 Recommended Deployment (Docker)

FROM pytorch/pytorch:2.2.0-cuda11.8-cudnn8-runtime

WORKDIR /app

RUN pip install renorm-native

COPY . .

CMD ["python", "main.py"]

📈 Benchmark (Example Placeholder)

Layer	Stability Score	NaN Rate
torch.nn.LayerNorm	baseline	medium under stress
renorm-native	improved	near-zero

(Replace with your real measured results when ready — do NOT leave as-is in final production release if publishing publicly.)

🌐 Roadmap

Planned improvements:

Distributed kernel optimization (multi-GPU aware routing)
Expanded attention primitives
Quantization-aware renormalization mode
Torch compile integration (torch.compile support)

📩 Support

For production integration or enterprise deployment:

GitHub Issues: https://github.com/Tobi-Adesoye/renorm-native
Contact: Adesoyetobe@gmail.com

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
benchmark		benchmark
dist		dist
src		src
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
500_layer_stress_test.py		500_layer_stress_test.py
BUSINESS_CASE.md		BUSINESS_CASE.md
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
deploy_inference_server.py		deploy_inference_server.py
gateway.py		gateway.py
gateway_profiles.json		gateway_profiles.json
loopguard.py		loopguard.py
profiler.py		profiler.py
pyproject.toml		pyproject.toml
renorm_production_model.pt		renorm_production_model.pt
requirements.txt		requirements.txt
run_ab_benchmark.py		run_ab_benchmark.py
run_ab_benchmark_v2.py		run_ab_benchmark_v2.py
run_ab_stress_test.py		run_ab_stress_test.py
run_autoregressive_generation.py		run_autoregressive_generation.py
run_autotuner_check.py		run_autotuner_check.py
run_convergence_universe.py		run_convergence_universe.py
run_deep_transformer_test.py		run_deep_transformer_test.py
run_tracking_loop.py		run_tracking_loop.py
run_transformer_check.py		run_transformer_check.py
run_verification.py		run_verification.py
save_production_checkpoint.py		save_production_checkpoint.py
scheduler.py		scheduler.py
setup.py		setup.py
telemetry.py		telemetry.py
test_gateway.py		test_gateway.py
tools.py		tools.py
train_entrypoint.py		train_entrypoint.py
verification_suite.py		verification_suite.py
verify_mxfp4_volatility.py		verify_mxfp4_volatility.py

Folders and files

Latest commit

History

Repository files navigation

renorm-native

🚀 Overview

📦 Installation

⚡ Quick Start (30 seconds)

Transformer Layer Example

Expected Output

🧠 Core API

1. RenormTransformerLayer

Parameters:

2. RenormLinear

⚙️ Device Compatibility

🧪 Minimal Validation Test

🏗 Architecture Summary

📊 Stability Design Principles

1. Variance Stabilization

2. Memory Safety

3. Execution Portability

📌 Example Use Case

⚠️ Notes

📄 License

🤝 Contributing

🔗 Project

🧩 Enterprise / Production Add-On Section

🏢 Enterprise / Production Usage

🔐 Enterprise License Mode (Optional)

Environment Variable

Format

Programmatic Validation

Failure Modes

⚙️ Production Integration Pattern

🧪 CI / Validation Test

📊 Performance Notes

🔄 Compatibility Matrix

🧠 Design Philosophy

📦 Recommended Deployment (Docker)

📈 Benchmark (Example Placeholder)

🌐 Roadmap

📩 Support

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages