Austenitic Stainless Steel grade 304L and 316L are very important alloys used in various high temperature applications, which make it important to study their mechanical properties at elevated ...temperatures. In this work, the mechanical properties such as ultimate tensile strength (UTS), yield strength (YS), % elongation, strain hardening exponent (n) and strength coefficient (K) are evaluated based on the experimental data obtained from the uniaxial isothermal tensile tests performed at an interval of 50°C from 50°C to 650°C and at three different strain rates (0.0001, 0.001 and 0.01s−1). Artificial Neural Networks (ANN) are trained to predict these mechanical properties. The trained ANN model gives an excellent correlation coefficient and the error values are also significantly low, which represents a good accuracy of the model. The accuracy of the developed ANN model also conforms to the results of mean paired t-test, F-test and Levene's test.
In distributed machine learning, where agents collaboratively learn from diverse private data sets, there is a fundamental tension between
consensus
and
optimality
. In this paper, we build on recent ...algorithmic progresses in distributed deep learning to explore various consensus-optimality trade-offs over a fixed communication topology. First, we propose the
incremental consensus
-based distributed stochastic gradient descent (i-CDSGD) algorithm, which involves multiple consensus steps (where each agent communicates information with its neighbors) within each SGD iteration. Second, we propose the
generalized consensus
-based distributed SGD (g-CDSGD) algorithm that enables us to navigate the full spectrum from complete consensus (all agents agree) to complete disagreement (each agent converges to individual model parameters). We analytically establish convergence of the proposed algorithms for strongly convex and nonconvex objective functions; we also analyze the momentum variants of the algorithms for the strongly convex case. We support our algorithms via numerical experiments, and demonstrate significant improvements over existing methods for collaborative deep learning.
Plants encounter a variety of beneficial and harmful insects during their growth cycle. Accurate identification (i.e., detecting insects' presence) and classification (i.e., determining the type or ...class) of these insect species is critical for implementing prompt and suitable mitigation strategies. Such timely actions carry substantial economic and environmental implications. Deep learning-based approaches have produced models with good insect classification accuracy. Researchers aim to implement identification and classification models in agriculture, facing challenges when input images markedly deviate from the training distribution (e.g., images like vehicles, humans, or a blurred image or insect class that is not yet trained on). Out-of-distribution (OOD) detection algorithms provide an exciting avenue to overcome these challenges as they ensure that a model abstains from making incorrect classification predictions on images that belong to non-insect and/or untrained insect classes. As far as we know, no prior in-depth exploration has been conducted on the role of the OOD detection algorithms in addressing agricultural issues. Here, we generate and evaluate the performance of state-of-the-art OOD algorithms on insect detection classifiers. These algorithms represent a diversity of methods for addressing an OOD problem. Specifically, we focus on extrusive algorithms, i.e., algorithms that wrap around a well-trained classifier without the need for additional co-training. We compared three OOD detection algorithms: (a) maximum softmax probability, which uses the softmax value as a confidence score; (b) Mahalanobis distance (MAH)-based algorithm, which uses a generative classification approach; and (c) energy-based algorithm, which maps the input data to a scalar value, called energy. We performed an extensive series of evaluations of these OOD algorithms across three performance axes: (a) Base model accuracy: How does the accuracy of the classifier impact OOD performance? (b) How does the level of dissimilarity to the domain impact OOD performance? (c) Data imbalance: How sensitive is OOD performance to the imbalance in per-class sample size? Evaluating OOD algorithms across these performance axes provides practical guidelines to ensure the robust performance of well-trained models in the wild, which is a key consideration for agricultural applications. Based on this analysis, we proposed the most effective OOD algorithm as wrapper for the insect classifier with highest accuracy. We presented the results of its OOD detection performance in the paper. Our results indicate that OOD detection algorithms can significantly enhance user trust in insect pest classification by abstaining classification under uncertain conditions.
In this paper, to predict flow stress of Austenitic Stainless Steel (ASS) 304 at elevated temperatures the extended Rusinek–Klepaczko (RK) model has been modified using an exponential strain ...dependent term for dynamic strain aging (DSA) region. Isothermal tensile tests are conducted on ASS 304 for a temperature range of 323–923K with an interval of 50K and at strain rates of 0.0001s−1, 0.001s−1, 0.01s−1 and 0.1s−1. DSA phenomenon is observed from 623 to 923K at 0.0001s−1, 0.001s−1 and 0.01s−1. Material constants are calculated using data obtained from these tensile tests for non-DSA and DSA region separately. The predicted results from the RK model are compared with the experimental data to check the accuracy of the constitutive relation. It is observed that to find out the constants of this model, some initial assumptions are required, and these initial values affect the predicted values. Hence, Genetic Algorithm (GA) is used to optimize the constants for RK model. Statistical measures such as the correlation coefficient, the average absolute error and standard deviation are used to measure the accuracy of the model. The resulting values of the correlation coefficient for ASS 304 for non-DSA and DSA region using modified extended RK model are 0.9828 and 0.9701. This modified, extended RK model is compared with Johnson–Cook (JC), Zerilli–Armstrong (ZA) and Arrhenius models and it is observed that specifically in DSA region, the modified extended RK model gives highly accurate predictions.
Topology optimization has emerged as a popular approach to refine a component’s design and increase its performance. However, current state-of-the-art topology optimization frameworks are ...compute-intensive, mainly due to multiple finite element analysis iterations required to evaluate the component’s performance during the optimization process. Recently, machine learning (ML)-based topology optimization methods have been explored by researchers to alleviate this issue. However, previous ML approaches have mainly been demonstrated on simple two-dimensional applications with low-resolution geometry. Further, current methods are based on a single ML model for end-to-end prediction, which requires a large dataset for training. These challenges make it non-trivial to extend current approaches to higher resolutions. In this paper, we develop deep learning-based frameworks consistent with traditional topology optimization algorithms for 3D topology optimization with a reasonably fine (high) resolution. We achieve this by training multiple networks, each learning a different step of the overall topology optimization methodology, making the framework more consistent with the topology optimization algorithm. We demonstrate the application of our framework on both 2D and 3D geometries. The results show that our approach predicts the final optimized design better (5.76× reduction in total compliance MSE in 2D; 2.03× reduction in total compliance MSE in 3D) than current ML-based topology optimization methods.
•A deep learning framework consistent with SIMP algorithm for topology optimization.•Framework scalable to high resolution in 3D structural topology optimization.•Uses intermediate density and compliance to improve the final topology prediction.•Framework results validated using ground truth SIMP-based topology optimization.•More than 5x reduction in error compared to a single density-based ML model.
Decentralized deep learning algorithms leverage peer-to-peer communication of model parameters and/or gradients over communication graphs among the learning agents with access to their private data ...sets. The majority of the studies in this area focus on achieving high accuracy, with many at the expense of increased communication overhead among the agents. However, large peer-to-peer communication overhead often becomes a practical challenge, especially in harsh environments such as for an underwater sensor network. In this paper, we aim to reduce communication overhead while achieving similar performance as the state-of-the-art algorithms. To achieve this, we use the concept of Minimum Connected Dominating Set from graph theory that is applied in ad hoc wireless networks to address communication overhead issues. Specifically, we propose a new decentralized deep learning algorithm called minimum connected Dominating Set Model Aggregation (DSMA). We investigate the efficacy of our method for different communication graph topologies with a small to large number of agents using varied neural network model architectures. Empirical results on benchmark data sets show a significant (up to 100X) reduction in communication time while preserving the accuracy or in some cases, increasing it compared to the state-of-the-art methods. We also present an analysis to show the convergence of our proposed algorithm.
•Proposing a communication-efficient protocol for decentralized learning systems.•Finding valuable connections in the network to reduce communication overhead.•Hybrid of federated learning and gossiping methods.•Verifying the method performance for different networks, datasets, and models.•Preserving accuracy while reducing communication, even in large-scale networks.
Recent advances in generative modeling, namely Diffusion models, have revolutionized generative modeling, enabling high-quality image generation tailored to user needs. This paper proposes a ...framework for the generative design of structural components. Specifically, we employ a Latent Diffusion model to generate potential designs of a component that can satisfy a set of problem-specific loading conditions. One of the distinct advantages our approach offers over other generative approaches is the editing of existing designs. We train our model using a dataset of geometries obtained from structural topology optimization utilizing the SIMP algorithm. Consequently, our framework generates inherently near-optimal designs. Our work presents quantitative results that support the structural performance of the generated designs and the variability in potential candidate designs. Furthermore, we provide evidence of the scalability of our framework by operating over voxel domains with resolutions varying from 323 to 1283. Our framework can be used as a starting point for generating novel near-optimal designs similar to topology-optimized designs.
•Latent diffusion model for generating 3D structural component designs.•Framework for generating component designs consistent with topology optimization.•Generated designs have similar (near-optimal) strain energy to SIMP designs.•Large scale 3D voxel dataset for structural topology optimization.
Bioprosthetic heart valves (BHVs) are commonly used as heart valve replacements but they are prone to fatigue failure; estimating their remaining life directly from medical images is difficult. ...Analyzing the valve performance can provide better guidance for personalized valve design. However, such analyses are often computationally intensive. In this work, we introduce the concept of deep learning (DL) based finite element analysis (DLFEA) to learn the deformation biomechanics of bioprosthetic aortic valves directly from simulations. The proposed DL framework can eliminate the time-consuming biomechanics simulations, while predicting valve deformations with the same fidelity. We present statistical results that demonstrate the high performance of the DLFEA framework and the applicability of the framework to predict bioprosthetic aortic valve deformations. With further development, such a tool can provide fast decision support for designing surgical bioprosthetic aortic valves. Ultimately, this framework could be extended to other BHVs and improve patient care.
Abstract
We have developed a differentiable programming framework for truncated hierarchical B-splines (THB-splines), which can be used for several applications in geometry modeling, such as surface ...fitting and deformable image registration, and can be easily integrated with geometric deep learning frameworks. Differentiable programming is a novel paradigm that enables an algorithm to be differentiated via automatic differentiation, i.e., using automatic differentiation to compute the derivatives of its outputs with respect to its inputs or parameters. Differentiable programming has been used extensively in machine learning for obtaining gradients required in optimization algorithms such as stochastic gradient descent (SGD). While incorporating differentiable programming with traditional functions is straightforward, it is challenging when the functions are complex, such as splines. In this work, we extend the differentiable programming paradigm to THB-splines. THB-splines offer an efficient approach for complex surface fitting by utilizing a hierarchical tensor structure of B-splines, enabling local adaptive refinement. However, this approach brings challenges, such as a larger computational overhead and the non-trivial implementation of automatic differentiation and parallel evaluation algorithms. We use custom kernel functions for GPU acceleration in forward and backward evaluation that are necessary for differentiable programming of THB-splines. Our approach not only improves computational efficiency but also significantly enhances the speed of surface evaluation compared to previous methods. Our differentiable THB-splines framework facilitates faster and more accurate surface modeling with local refinement, with several applications in CAD and isogeometric analysis.
Structural topology optimization is a compute-intensive process due to several iterations of simulations required to evaluate the performance of the component during optimization. Deep learning (DL) ...based approaches can address this challenge, but these methods were demonstrated mainly using 2D shapes and, at best, in low-resolution 3D geometries (typically 323). Further, due to non-manufacturable geometric features, the predicted optimal geometries from DL may not be manufacturable, even using additive manufacturing. In this paper, we develop a DL framework using a multigrid convolutional neural network (CNN) to generate high-resolution topology-optimized 3D geometries with additional checks on the manufacturability of the predicted shapes. Our framework predicts the final optimal topology using the initial strain energy (objective function of structural topology optimization) and target volume fraction (material fraction to be preserved after optimization) as input. We train the network using a multigrid approach, which enables topology optimization at 1283 resolution, which was previously computationally challenging. We first train the multigrid CNN at a lower resolution and then transfer the learned network to continue training at higher resolutions. We use a distributed deep learning framework on a GPU supercomputing cluster to further speed up the training time. Distributed DL significantly speeds up the training time by more than 4× while achieving similar model performance. Finally, we check the optimal geometries for manufacturability using fused deposition modeling (FDM)-specific manufacturability constraints. The large training dataset (>60,000 high-resolution topology optimization examples) will be released with the paper to enable further research on this topic.
•Multigrid approach to train the neural network at fine resolution of 1283.•Demonstrate the manufacturability by 3D printing several sample models predicted our framework.•A data-parallel distributed deep learning framework to accelerate the training process.•A comprehensive high-resolution data set consisting of more than 60k optimal shapes.