Using Machine Learning for Anomaly Pattern Recognition in Manufacturing Processes
As the manufacturing sector is under constant pressure to satisfy customers’ demands in a competitive market by applying complex processes to meet manufacturing cost and schedule goals, the need to identify quality variables within processes is occurring at a faster rate. Locating the source of process variations becomes more challenging for engineers. Each day, the manufacturing sector generates tremendous amounts of data that provide valuable information. This data is crucial to supporting strategic business operations decision-making. Traditional ways of data interpretation are labor intensive and time consuming. Failure to accurately and precisely translate data will lead to subjective “opinion” or “speculation-based” decision-making.
In this paper, we will review general opportunities for the application of machine learning (ML) algorithms and methods to the test data troubleshooting process. A method is developed for analyzing data and identifying patterns that are consistent with poorly performing units. This method uses a “quasi-supervised” learning technique to identify drivers of variance within a dataset, visualize the trends among the primary drivers of variance, and establish some screening limits based on those trends. The method employs Principal Components Analysis (PCA) to review patterns, trends, and uses some knowledge of better or worse performing groups. The output is a set of screening limits that characterize parts likely to have similar performance. The method provides clear knowledge, visualization, and understanding of the trends that are driving failures or poor performers.In addition, it does not require the rigorous data capture that a true supervised learning method. This method can be used on any dataset with observations in the rows and attributes/variables in the columns if there is some knowledge of an identifiable batch that is better or worse than the others. A performance characterization on a batch of units was successfully performed to identify the anomalies within a dataset.