Filtered not mixed: filtering-based online gating for mixture of large language models

Saqur, R; Kratsios, A; Krach, F; Limmer, Y; Tian, J-J; Willes, J; Horvath, B; Rudzicz, F

AI Collection

Conference item

Filtered not mixed: filtering-based online gating for mixture of large language models

Abstract:: We propose MoE-F - a formalized mechanism for combining N pre-trained expert Large Language Models (LLMs) in online time-series prediction tasks. MoE-F adaptively forecasts the optimal weighting of LLM predictions at each time step by leveraging the conditional information in each expert's running performance, enabling the best combination of experts for the next step prediction. Diverging from static (learned) Mixture of Experts (MoE) methods, our approach employs time-adaptive stochastic filtering techniques to combine experts. By framing the expert selection problem as a finite state-space, continuous-time Hidden Markov model (HMM), we can leverage the Wonham-Shiryaev filter. Our approach first constructs N parallel filters corresponding to each N individual LLMs. Each filter proposes its best combination of LLMs, given the information that they have access to. Subsequently, the N filter outputs are optimally aggregated to maximize their robust predictive power, and this update is computed efficiently via a closed-form expression, thus generating our ensemble predictor. Our contributions are: (I) the MoE-F algorithm - deployable as a plug-and-play filtering harness over any heterogenous mixture of LLMs or specialized models, (II) theoretical optimality guarantees of the proposed filtering-based gating algorithm (via optimality guarantees for its parallel Bayesian filtering and its robust aggregation steps), and (III) empirical evaluation and ablative results using state of the art foundational and MoE LLMs on a real-world Financial Market Movement task based on streaming news where MoE-F attains a 17% absolute and 48.5% relative F1-score improvement over the best performing individual LLM expert. Further, we provide empirical evidence of substantial performance gains with MoE-F over specialized models in the long-horizon time-series forecasting domain using electricity-grid datasets. Supplementary materials available at: https://github.com/raeidsaqur/moe-f.

Publication status:: Published

Peer review status:: Peer reviewed

Actions

Email

Email this record

Send the bibliographic details of this record to your email address.

Your Email
Please enter the email address that the record information will be sent to.

-
Your message (optional)
Please add any additional information to be included within the email.
Share
Cite

Cite this record

APA Style

Saqur, R., Kratsios, A., Krach, F., Limmer, Y., Tian, J.-J., Willes, J., Horvath, B., & Rudzicz, F. (2025). Filtered not mixed: filtering-based online gating for mixture of large language models. 13th International Conference on Learning Representations (ICLR 2025).

MLA Style

Saqur, R, et al. “Filtered Not Mixed: Filtering-Based Online Gating for Mixture of Large Language Models.” 13th International Conference on Learning Representations (ICLR 2025), 2025.

Chicago Style

Saqur, R, A Kratsios, F Krach, Y Limmer, J-J Tian, J Willes, B Horvath, and F Rudzicz. 2025. “Filtered Not Mixed: Filtering-Based Online Gating for Mixture of Large Language Models.” In 13th International Conference on Learning Representations (ICLR 2025). OpenReview.
Print

Access Document

Files:: Saqur_et_al_2025_Filtered_not_mixed.pdf

(Preview, Version of record, pdf, 853.6KB, Terms of use)

Publication website:: https://openreview.net/forum?id=ecIvumCyAj

Authors

+ Saqur, R More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Mathematical Institute
Role:: Author
ORCID:: 0000-0002-6330-5480

+ Kratsios, A More by this author

Role:: Author

+ Krach, F More by this author

Role:: Author

+ Limmer, Y More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Mathematical Institute
Role:: Author
ORCID:: 0000-0002-8418-7284

+ Tian, J-J More by this author

Role:: Author

More authors...

+ Canadian Institute for Advanced Research More from this funder

Funder identifier:: https://ror.org/01sdtdd95
Grant:: RGPIN-2023-04482

+ Natural Sciences and Engineering Research Council of Canada More from this funder

Funder identifier:: https://ror.org/01h531d29

Publisher:: OpenReview
Host title:: Proceedings of the 13th International Conference on Learning Representations (ICLR 2025)
Article number:: 7448
Publication date:: 2025-01-22
Acceptance date:: 2025-01-22
Event title:: 13th International Conference on Learning Representations (ICLR 2025)
Event location:: Singapore
Event website:: https://iclr.cc/Conferences/2025
Event start date:: 2025-04-24
Event end date:: 2025-04-28

Language:: English
Pubs id:: 2282226
Local pid:: pubs:2282226
Deposit date:: 2026-04-02
ARK identifier:: ark:/29072/ora_2ead59d6e0324775afd8ffe5a8aa031e

Terms of use

Copyright holder:: Saqur et al.

Licence:: CC Attribution (CC BY)

Views and Downloads

About views and downloads

If you are the owner of this record, you can report an update to it here: Report update to this record

Conference item

Filtered not mixed: filtering-based online gating for mixture of large language models

Actions

Access Document

Authors

Terms of use

Views and Downloads

Altmetrics

Dimensions

Conference item

Filtered not mixed: filtering-based online gating for mixture of large language models

Actions

Access Document

Authors

Funding

Bibliographic Details

Item Description

Terms of use

Metrics

Views and Downloads

Altmetrics

Dimensions