Conceptual Approaches to Organizing Feature Stores in High-Load Machine Learning Systems
Keywords:
feature store, MLOps, high-load systems, online inference, offline training, point-in-time correctness, feature freshness, data consistency, geo-distribution, maintenance schedulingAbstract
The paper examines conceptual approaches to organizing feature stores in high-load machine learning systems. The growth of production ML has exposed persistent problems, including duplicated feature logic, training–serving skew, and brittle pipelines, particularly under low-latency inference and heavy batch training workloads. This paper proposes an original analytical framework for organizing feature stores under high-load conditions. The framework links internal architectural decisions (registry design, transformation semantics, offline/online materialization, synchronization, geo-distribution, and maintenance control) to operational objectives (latency, point-in-time correctness, reproducibility, governance, and cost discipline), enabling structured design selection rather than a descriptive survey of existing systems. The study characterizes functional blocks of a feature store (feature registry, transformation layer, offline and online materializations, and serving interfaces), then analyzes consistency guarantees, point-in-time correctness, refresh strategies, and geo-distribution patterns. Methods rely on a comparative analysis of recent systems and surveys, with a focus on scalability bottlenecks in joins, cache design, and maintenance scheduling. The conclusion formulates design recommendations for selecting storage layouts, synchronization policies, and operational controls across the ML lifecycle. The paper targets engineers and researchers working on production MLOps and data infrastructure. Implications for feature reuse and compliance are discussed.
References
[1] Berberi, L., Kozlov, V. I., Nguyen, G., Sáinz-Pardo Díaz, J., Calatrava, A., Moltó, G., Tran, V., & López García, Á. (2025). Machine learning operations landscape: platforms and tools. Artificial Intelligence Review, 58, Article 167. https://doi.org/10.1007/s10462-025-11164-3
[2] de la Rúa Martínez, J., Buso, F., Kouzoupis, A., Ormenisan, A. A., Niazi, S., Bzhalava, D., Mak, K., Jouffrey, V., Ronström, M., Cunningham, R., Zangis, R., Mukhedkar, D., Khazanchi, A., Vlassov, V., & Dowling, J. (2024). The Hopsworks Feature Store for Machine Learning. In Companion of the 2024 International Conference on Management of Data (SIGMOD/PODS ’24) (pp. 135–147). ACM. https://doi.org/10.1145/3626246.3653389
[3] Fang, Z., Yi, Y., Zhang, J., Liu, Y., Mu, Y., Lu, Q., Xu, X., Wang, J., Wang, C., Zhang, S., & Chen, S. (2023). MLOps spanning whole machine learning life cycle: A survey (arXiv:2304.07296). arXiv.
[4] Hewage, N., & Meedeniya, D. (2022). Machine learning operations: A survey on MLOps tool support (arXiv:2202.10169). arXiv.
[5] Kreuzberger, D., Kühl, N., & Hirschl, S. (2022). Machine learning operations (MLOps): Overview, definition, and architecture (arXiv:2205.02302). arXiv
[6] Li, A., Ranganathan, B., Pan, F., Zhang, M., Xu, Q., Li, R., Raman, S., Shah, S. P., & Tang, V. (2023). Managed geo-distributed feature store: Architecture and system design (arXiv:2305.20077). arXiv.
[7] Liu, R., Park, K., Psallidas, F., Zhu, X., Mo, J., Sen, R., Interlandi, M., Karanasos, K., Tian, Y., & Camacho-Rodríguez, J. (2023). Optimizing data pipelines for machine learning in feature stores. Proceedings of the VLDB Endowment, 16, 4230–4239. https://doi.org/10.14778/3625054.3625060
[8] Orr, L., Sanyal, A., Ling, X., Goel, K., & Leszczynski, M. (2021). Managing ML pipelines: Feature stores and the coming wave of embedding ecosystems (arXiv:2108.05053). arXiv.
[9] Wooders, S., Mo, X., Narang, A., Lin, K., Stoica, I., Hellerstein, J. M., Crooks, N., & Gonzalez, J. E. (2023). RALF: Accuracy-aware scheduling for feature store maintenance. Proceedings of the VLDB Endowment, 17(3), 563–576. https://doi.org/10.14778/3632093.3632116
[10] Zarour, M., Alzabut, H., & Al-Sarayreh, K. T. (2025). MLOps best practices, challenges and maturity models: A systematic literature review. Information and Software Technology, 183, 107733. https://doi.org/10.1016/j.infsof.2025.107733
[11] Kovalchuk A. A comprehensive model of business consulting for small and medium-sized enterprises. - K.: Vidavnichy house "Internauka", 2025. - 98 p.
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Artem Korkhov

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Authors who submit papers with this journal agree to the following terms.