Build RNN-Transformer hybrid models

rwkv-architectureskillsetup L39,423
Orchestra-Research/AI-Research-SKILLs
What it does

Build RNN+Transformer hybrid with O(n) inference using RWKV architecture

Best for

Combining RNN efficiency with Transformer quality when linear-time inference matters.

Inputs
  • · model config
  • · training data
  • · inference input
Outputs
  • · trained RWKV model
  • · inference output
  • · memory usage metrics
Requires
  • · rwkv
  • · torch
  • · transformers
Preconditions
  • · GPU available
  • · PyTorch 2.0+
Failure modes
  • · training slower than Transformers
  • · inference unstable if learning rate too high
Trust signals
  • · O(n) complexity proven
  • · quality comparable to Transformers on benchmarks