Build RNN-Transformer hybrid models
rwkv-architectureskillsetup L3★9,423
Orchestra-Research/AI-Research-SKILLs ↗What it does
Build RNN+Transformer hybrid with O(n) inference using RWKV architecture
Best for
Combining RNN efficiency with Transformer quality when linear-time inference matters.
Inputs
- · model config
- · training data
- · inference input
Outputs
- · trained RWKV model
- · inference output
- · memory usage metrics
Requires
- · rwkv
- · torch
- · transformers
Preconditions
- · GPU available
- · PyTorch 2.0+
Failure modes
- · training slower than Transformers
- · inference unstable if learning rate too high
Trust signals
- · O(n) complexity proven
- · quality comparable to Transformers on benchmarks