This paper introduces text-to-text regression as a novel approach to predicting the performance of large-scale industrial systems, like Google's Borg compute cluster. Unlike traditional tabular methods that struggle with complex, non-tabular data such as configuration files and system logs, this method utilizes encoder-decoder Regression Language Models (RLMs). The research demonstrates that these RLMs can achieve high accuracy (up to 0.99 rank correlation), adapt efficiently to new tasks with minimal new data, and accurately capture the densities of complex outcome distributions. The findings highlight the importance of observing comprehensive features, extensive pretraining for transfer learning, and the model's inherent uncertainty quantification, paving the way for more universal system simulators.
資訊
- 節目
- 頻率每週更新
- 發佈時間2025年8月30日 下午6:00 [UTC]
- 長度16 分鐘
- 年齡分級兒少適宜