Automated Classification of Large-Scale Network Configurations Using Machine Learning and Semantic Vectorization

Uncategorized

Authors: Narendra Reddy Burramukku

Abstract: The rapid expansion of large-scale computer networks has introduced significant complexity in managing diverse network configurations. Manual classification and analysis of configurations are time-consuming, error-prone, and increasingly infeasible in dynamic environments. This paper presents a novel framework for automated classification of large-scale network configurations using machine learning combined with semantic vectorization. Network configuration files are first pre-processed and transformed into high-dimensional vector representations that capture both semantic and hierarchical relationships among configuration commands, protocols, and policies. These embeddings serve as input to supervised machine learning models, including Random Forest, Support Vector Machines, and Neural Networks, enabling accurate classification of network devices, roles, and compliance profiles. Experiments are conducted on real-world enterprise, cloud, and synthetic network datasets, comprising thousands of configuration files with diverse structures and device types. Results demonstrate that the proposed framework significantly outperforms traditional rule-based and feature-based approaches, achieving up to 94.5% F1-score with graph-based embeddings. Scalability analysis indicates the method can efficiently handle large volumes of configurations while maintaining high accuracy. The study highlights the effectiveness of semantic vectorization in capturing complex configuration semantics and facilitating robust automated classification. This framework provides a foundation for intelligent, scalable network management, supporting proactive policy enforcement, misconfiguration detection, and operational efficiency. Future work explores real-time classification, integration with network orchestration systems, and transformer-based embeddings for richer semantic representation.

DOI: https://doi.org/10.5281/zenodo.18383730

 

× How can I help you?