IEEE 2019-2020 Machine Learning Projects in Java

Cloud network monitoring data is dynamic and distributed. Signals to monitor the cloud can appear, disappear or change their importance and clarity over time. Machine learning (ML) models tuned to a given data set can therefore quickly become inadequate. A model might be highly accurate at one point in time but may lose its accuracy at a later time due to changes in input data and their features. Distributed learning with dynamic model selection is therefore often required. Under such selection, poorly performing models (although aggressively tuned for the prior data) are retired or put on standby while new or standby models are brought in. The well-known method of Ensemble ML (EML) may potentially be applied to improve the overall accuracy of a family of ML models. Unfortunately, EML has several disadvantages, including the need for continuous training, excessive computational resources, requirement for large training datasets, high risks of overfitting, and a time-consuming model-building process. In this paper, we propose a novel cloud methodology for automatic ML model selection and tuning that automates model building and selection and is competitive with existing methods. We use unsupervised learning to better explore the data space before the generation of targeted supervised learning models in an automated fashion. In particular, we create a Cloud DevOps architecture for autotuning and selection based on container orchestration and messaging between containers, and take advantage of a new autoscaling method to dynamically create and evaluate instantiations of ML algorithms. The proposed methodology and tool are demonstrated on cloud network security datasets.
Intelligent networks are regarded as existing networks incorporating some intelligent mechanisms such as cognitive and cooperative approaches to improve network performance. Security is highly essential in intelligent networks but has received less attention so far. In this article, we propose a framework that enables a secure intelligent network with the assistance of cloud-assisted privacy-preserving machine learning. In the framework, the cloud server can first generate a model using outsourced machine learning algorithms and then process testing data from the network with the generated model in real time, which reflects to the network and makes it more intelligent. At the same time, the proposal guarantees the security and privacy of both the training data and the testing data in the sense that the proposed framework takes advantage of differential privacy to perform privacy-preserving data analysis and homomorphic encryption to conduct valid operations over encrypted data. The performance evaluations of the core primitives employed in the framework including differential privacy and homomorphic encryption algorithms demonstrate the practicability of our proposal.
An industrial mobile network is crucial for industrial production in the Internet of Things. It guarantees the normal function of machines and the normalization of industrial production. However, this characteristic can be utilized by spammers to attack others and influence industrial production. Users who only share spams, such as links to viruses and advertisements, are called spammers. With the growth of mobile network membership, spammers have organized into groups for the purpose of benefit maximization, which has caused confusion and heavy losses to industrial production. It is difficult to distinguish spammers from normal users owing to the characteristics of multidimensional data. To address this problem, this paper proposes a spammer identification scheme based on Gaussian mixture model (SIGMM) that utilizes machine learning for industrial mobile networks. It provides intelligent identification of spammers without relying on flexible and unreliable relationships. SIGMM combines the presentation of data, where each user node is classified into one class in the construction process of the model. We validate the SIGMM by comparing it with the reality mining algorithm and hybrid fuzzy c-means (FCM) clustering algorithm using a mobile network dataset from a cloud server. Simulation results show that SIGMM outperforms these previous schemes in terms of recall, precision, and time complexity.
Social media networks have shown rapid growth in the past, and massive social data are generated which can reveal behavior or emotion propensities of users. Numerous social researchers leverage machine learning technology to build social media analytic models which can detect the abnormal behaviors or mental illnesses from the social media data effectively. Although the researchers only public the prediction interfaces of the machine learning models, in general, these interfaces may leak information about the individual data records on which the models were trained. Knowing a certain user's social media record was used to train a model can breach user privacy. In this paper, we present SocInf and focus on the fundamental problem known as membership inference. The key idea of SocInf is to construct a mimic model which has a similar prediction behavior with the public model, and then we can disclose the prediction differences between the training and testing data set by abusing the mimic model. With elaborated analytics on the predictions of the mimic model, SocInf can thus infer whether a given record is in the victim model's training set or not. We empirically evaluate the attack performance of SocInf on machine learning models trained by Xgboost, logistics, and online cloud platform. Using the realistic data, the experiment results show that SocInf can achieve an inference accuracy and precision of 73% and 84%, respectively, in average, and of 83% and 91% at best.
Semi-supervised learning (SSL) based on manifold regularization (MR) is an excellent learning framework. However, the performance of SSL heavily depends on the construction of manifold graph and the safety degrees of unlabeled samples. Due to the construction of manifold graph and safety degrees of unlabeled samples are usually pre-construct before classification and fixed during the classification learning process, which results independent with the subsequent classification. Aiming at the above problems, we propose a unified adaptive safe semi-supervised learning (Adap-SaSSL) framework. This framework adaptively constructs a manifold graph while adaptively calculating the safety degrees of unlabeled samples. Specifically, the weights of manifold graph and its parameters, as well as the safety degrees of unlabeled samples will be optimized during the learning process rather than being calculated in advance. Finally, we then develop and implement a adaptive safe classification method based on the Adap-SaSSL framework, which is called adaptive safe semi-supervised extreme learning machine (AdSafe-SSELM). Experimental results on artificial, benchmark and image datasets show that the performance of AdSafe-SSELM is effective and reliable compared to other algorithms.