Since its inception, the field of robotics has drawn inspiration from nature. The main aspect of nature imitation in robotics is apparent in the design and structure of the robots’ bodies that aim to mimic biological systems. By looking at the knowledge gained through biomechanics studies, living creatures with mobile abilities are mainly classified into two groups based on their body structure: vertebrates and invertebrates. Vertebrates include fish, mammals, birds, amphibians, and reptiles; invertebrates include crustaceans (crab, lobster), echinoderms (starfish, sea urchin), coelenterates (jellyfish), arachnids, molluscs (octopus, squid), insects, and worms, among others
[17].
The challenge of building robotic systems with motion capabilities similar to those of these creatures lies in their body construction, which exhibits compliance ranging from only a few parts such as an elephant’s trunk or mammals’ organs, to completely soft and deformable bodies in the case of some invertebrates such as jellyfish. The main contributor to this compliance is the elastic nature of the building blocks of these bodies such as muscles, tendons, skin, tissues, and cartilages, as they are known for having low Young’s modulus (less than one gigapascal)
[18].
Some attempts have been made to mimic some of these animals using hard materials. However, due to the limited degrees of freedom offered by conventional rigid robots compared to the infinite degrees and redundancy of soft bodies, different structures with continuum deformations had to be implemented. In contrast to conventional non-redundant rigid robots, discrete hyper-redundant and hard continuum robots offer large to infinite degrees of freedom, which brings them closer to mimicking vertebrates’ motion
[19][20]. Common examples include tendon-driven continuum manipulators
[21][22][23]. One of the first underwater robots to employ a structure of discrete multiple rigid-link sections actuated by tendons is the RoboTuna robotic fish
[24]; The VCUUV prototype, inspired by RoboTuna, uses hydraulic actuation to drive an articulated tail
[25]. Other serial multi-joint biomimetic fish robots have been developed to imitate carangiform swimming
[26][27].
Despite providing more degrees of freedom than rigid robots, hard continuum robots still lack the shape adaptability offered by soft robots, which would help bring robots closer to their bioinspired creatures. The Compliant Robotic Tuna (CRT)
[28] is an example of a biomimetic fish robot having a servo-actuated compliant body and tail and is able to perform swimming maneuvers. The Soft Robotic Fish (SoFi)
[29] is a marine exploration robot capable of 3D swimming that imitates fish motion. It is driven by a soft fluidic actuator and has a buoyancy control unit for depth adjustment. Other marine creatures such as batoids were also mimicked, as in the case of the stingray robot with a soft silicone outer body and pectoral fins
[30].
3.1.2. Design Optimization
Even when taking inspiration from nature, designing soft robots with the desired mechanical behaviors that allow them to perform specific tasks presents another challenge. The complexity of such robotic systems, due to their unconventional components from materials to actuation, makes it hard to use currently known design and simulation tools to build soft robots [31][32]. Optimization techniques have been proposed to help automate the design process, and bridge the gap between simulation, fabrication, and the actual performance of soft robots. The general optimization framework can be summarized as choosing the design behavior to be optimized, such as crawling or grasping; identifying the design variables to be optimized, such as the material and the actuation; and defining the constraints of the system. The optimization process iteratively evaluates the design candidates using analysis tools and searches for the optimal design.
One approach uses evolutionary optimization algorithms to automate the design and manufacturing of freeform soft robots. This approach uses voxel-based dynamic simulation to evaluate the morphology and locomotion of the robot [33]. Voxels are soft cubic blocks with specific parameters, such as stiffness and Poisson’s ratio, that undergo volumetric change when forces are applied to them. Another voxel-based method aims to optimize the morphology to achieve adaptability using the property of criticality, which allows the robot to perform more diverse tasks [34].
Another conceptual design approach provides a spatial grammar to build soft robots and optimize their design for locomotion and actuation [35]. The spatial grammar generates sub-assemblies of interconnected balls based on a set of defined rules. The generated models are then evaluated and optimized in terms of locomotion abilities.
Performing design optimization for underwater soft robots is an even more challenging problem, as the effect of the environment on the robot’s morphology needs to be taken into account. DiffAqua [36], a computational design pipeline, relies on differentiable simulation to perform gradient-based optimization for the geometry and control of soft underwater swimmers. The benefits of exploiting the morphology of soft robots and optimizing it to simplify the control are further discussed in the upcoming modeling and control sections.
Fabricating and assessing these designs are also challenging processes due to the traditional manufacturing methods being unsuitable for these unconventional soft materials. Additive manufacturing (AM) is one of the impactful technologies that helped enable this process [37][38]. One approach is to use AM to only fabricate the mold that would be used to pour the soft material in them. A more hybrid approach takes advantage of AM techniques, such as the fused filament fabrication (FFF) method, in addition to molding techniques to fabricate and assemble complex soft robotic systems. The third approach is the total additive manufacturing (TAM) approach. It exploits all the benefits of AM to fabricate soft robots, whether by 3D printing multiple soft parts and assemble them, or manufacturing the complete soft robot as a whole part. Such advances in 3D printing techniques for soft materials increased the ability to produce and test different designs of soft robots and optimize their morphological and material parameters.
3.2. Actuation
The actuation of soft robots poses several challenges due to the large number of degrees of freedom resulting from the large deformation of the soft materials that constitute them, making them underactuated systems that are harder to control. In addition, most conventional robotics actuators, such as DC motors, are bulky and rigid, which contradicts the main reason for developing soft robots with high compliance. Nonetheless, some soft robots use servo motors and gear pumps for fluidic actuation, while others use more unconventional actuators such as smart actuators, chemical reactions, and stiffness modulation
[39].
One common actuation method is the use of tendon wires that are anchored at several points in the body of the soft material. These cables are driven by applying tension to them using electric motors such as servos, causing the connected soft material to deform, resulting in different motions or shape changes of the soft body. One example is the bioinspired octopus’s arm
[40] made of silicone that is driven using cables. It can perform crawling motion and grasping similar to actual octopus tentacles. The use of traditional motors provides a large actuation force, especially in underwater applications where a powerful enough thrust is needed for locomotion. The shape deformation can be approximately determined through the displacement of the anchoring points of the cables.
Fluidic Elastomer Actuators (FEAs) is another type of soft actuators that rely mainly on fluid pressure
[41]. The actuators are made from hyperelastic materials with embedded channels that expand due to the applied pressure. One of the early implementations is the Pneumatic Artificial Muscles (PAMs), most notably the McKibben artificial muscle actuator
[42][43], which is made from a flexible elastomer tube constrained by a reinforced fiber to limit its extension but allow it to expand when pressurized, providing considerable force. Other types of fluidic elastomers use various means of pressurization, including pneumatic sources using compressed air
[44][45], pressurized gas such as CO
2 [46][47][48], or chemical pressure generation
[49][50], as well as hydraulic sources
[51][52][53]. The multigait crawling robot
[44][46] has pneumatic actuators with a Pneu-Net (PN) architecture. The PNs are composed of a series of extensible chambers that inflate when pressurized and an inextensible layer that constrains the expansion of the chambers, causing the elastomer to bend. The geometrical parameters of the chambers and the constraining layer guide the deformation of the elastomer, affecting its bending and twisting motion. Underwater applications using fluidic elastomers include a biomimetic autonomous fish with a bidirectional pneumatic elastomer
[48], an extended version of the former fish using a hydraulically pressurized elastomer instead
[52], and an underwater crawling robot having bellow fluidic actuators as legs
[53]. The completely soft Octobot
[50] relies on totally soft microfluidic logic to control gas generation through chemical fuel decomposition, causing actuation. The use of fluidic actuators is advantageous for obtaining high material deformation and the ability to arrange actuators in an agonist-antagonist form, similar to muscle pairs. However, they are slow and have delayed response, and their pressurization units can be hard to embed inside soft robots.
Another actuation approach is the use of different types of smart materials. Smart materials are distinct in their response to external thermal or electric stimuli, causing deformation or stiffness change to the material. Electroactive polymers (EAPs) use electric stimuli to deform. Dielectric elastomer actuators (DEAs) are a type of EAPs that comprise two compliant electrodes that are compressed when high voltage is applied to them
[54]. Compression force can be used to induce motion
[55][56]. Another type of EAPs used for soft robots’ actuation is ionic polymer metal composite (IPMC). It is composed of Nafion polymer and electrodes. Applying voltage to the electrodes causes the polymer to deform due to the ionization process and the motion of ions between the two electrodes
[57][58][59]. Shape memory alloys (SMAs) are smart materials that react to heat stimuli. When applying high temperature to the SMA, it deforms into a certain shape and is restored to its original shape after heat is removed. The heat is usually provided through electrical heating using high voltage. SMAs are used as actuators in soft robotics, as they can be embedded to drive a soft material such as polydimethylsiloxane (PDMS)
[60][61].
The use of smart actuators is prominent in underwater robotics
[62] due to the favorable operating conditions for smart materials in water. In addition, smart materials can be directly embedded within the elastically deformable body of the robots, making them a good option for biomimetic applications. For example, biomimicry of jellyfish was implemented using DEAs
[56] and using SMAs in the case of Robojelly
[63]. Manta ray biomimetic robots were actuated using IPMCs
[57] as well as SMAs
[64]. A biomimetic crawling starfish used actuated legs made from embedded SMA wires cast in PDMS
[60]. Another group developed a soft robotic arm inspired by octopus tentacles using cables and SMA springs
[61]. The SMA springs help mimic the muscular hydrostat of the octopus’s arm by providing transversal contraction. Smart actuators provide an advantage in terms of their compact size and weight, and high actuation biomimicry resembling real fish swimming modes. However, they require high-voltage sources and are hard to control. The various soft robotic platforms are shown in
Table 1, classifying their biomimetic inspiration, actuation types, swimming modes, and level of compliance.
Table 1. Classification of various underwater soft robotic systems.
Reference |
Robot |
Biomimicry |
Actuation |
Swimming |
Compliance |
[26] |
Multi-Joint Fish |
Carangiform Fish |
Electric Actuators (Servomotors) |
BCF Undulation |
Medium |
[58][59] |
Biomimetic Fish |
Fish |
IPMC |
BCF/MPF Oscillation |
Medium |
[29][48][52] |
SoFi |
Fish |
FEA (Pneumatic/Hydraulic) |
BCF Undulation |
High |
[30] |
Stingray Robot |
Stingray |
Electric Actuators (Servomotors) |
MPF Undulation |
Medium |
[40] |
Octopus Arm |
Octopus |
Motor-driven Cables |
Crawling |
High |
[61] |
Octopus Arm |
Octopus |
Motor-Driven Cables/SMA Springs |
- |
High |
[65] |
Octopus Robot |
Octopus |
Motor-Driven Cables/SMA |
Crawling |
Medium |
[55] |
Cuttlefish Robot |
Cuttlefish |
DEA |
Jet Propulsion |
Medium |
[63] |
Robojelly |
Jellyfish |
SMA |
Propulsion |
High |
[50] |
Octobot |
Octopus |
FEA (Chemical Reaction) |
- |
High |
[53] |
Morphing Underwater Walking Robot |
- |
FEA (Hydraulic) |
Walking/Crawling |
Medium |
[56] |
Jellyfish-Inspired Soft Robot |
Jellyfish |
DEA |
Propulsion |
High |
[58] |
Robotic Manta Ray |
Manta Ray |
IPMC |
MPF Undulation |
Medium |
[64] |
Micro Biomimetic Manta Ray |
Manta Ray |
SMA |
MPF Undulation |
Medium |
[60] |
Starfish Robot |
Starfish |
SMA Wires |
Propulsion |
High |
[66] |
Starfish-Like Soft Robot |
Starfish |
SMA |
Crawling |
High |
[67] |
RoboScallop |
Scallop |
FEA |
Jet Propulsion |
Medium |
[68] |
Eel-like Robot |
Leptocephalus (Eel Larva) |
Fluid Electrode DEA (FEDEA) |
BCF Undulation |
High |
[69] |
Morphing Limb Amphibious Turtle Robot |
Turtle/Tortoise |
Variable Stiffness Material-pneumatic Actuators |
Drag-induced Swimming/Walking |
Medium |
[70] |
FinRay Robotic Jellyfish |
Jellyfish |
FinRay Actuators driven with Servomotors |
Propulsion |
Medium |
[71] |
PATRICK: Brittle Star-Inspired Soft Robot |
Brittle Star |
SMA Wires |
Crawling |
High |
[72] |
Soft Underwater Starfish |
Starfish |
Servo-driven Tendon Wires |
Propulsion |
High |
3.3. Modeling
The modeling and control phase is the most challenging part of building functional soft robotic systems capable of performing complex tasks and intelligently interacting with their own environment. All the well-established modeling techniques for rigid robots cannot be applied to soft robots due to their continuum property and their complex non-linear dynamics inherent from their elastic behavior. As conventional kinematic and dynamic modeling methods are inapplicable, new approaches for modeling and control of soft robots are being developed (
Figure 2).
Figure 2. An overview of the different modeling and control techniques used in soft robotics.
These modeling approaches can be separated into two main categories: model-based approaches and model-free data-driven methods. While the former relies on formulating either an exactly accurate or a simplified approximate model of a soft robot, the latter tries to implicitly learn the behavior of the robot using input and output data collected directly from the actual system.
The main goal of the modeling process is to map the soft robot’s actuation space to the configuration or task space. Since continuum soft robots are infinite-dimensional systems, formulating such models becomes highly difficult. Instead, the modeling methods rely on approximations and assumptions to reduce the system to a finite-dimensional one. The most commonly used simplification for kinematic modeling is the constant curvature (CC) method
[73]. It assumes that the soft robot has constant strain along its whole length
[74]. The piecewise constant curvature (PCC) method is an extended version of the CC that assumes the strain to be piecewise constant, with each segment of the soft structure having a constant strain
[75]. A further extension is the variable curvature (VC)
[76][77], which models each section as a CC. The PCC approximation is mainly valid for cable-driven soft manipulators. However, it cannot capture all the complex dynamics of soft structures, as it is a steady-state model. Nonetheless, it proves to be an accurate model for control for a lot of tendon-driven manipulators
[78][79][80][81]. One group also used a PCC kinematic model of a multi-segment pneumatic actuator to develop a dynamic motion controller
[82].
Numerical techniques have also been used for modeling soft robots. The Cosserat rod theory is one of these numerical methods
[83][84], accurately representing the tension, shearing, bending, and torsion of rods. A different numerical approach for modeling soft robots is the finite element method (FEM), which yields more accurate results but at the cost of high computational requirements
[85]. The method relies on discretizing the structure into a large number of nodes, called mesh, and iteratively solving the differential equations governing the behavior of these nodes, until the model converges. However, the use of FEM for real-time control is difficult, so control approaches based on real-time FEM have been proposed
[86][87][88]. An approach combining kinematic modeling using PCC and Denavit–Hartenberg (DH) parameters with FEM analysis was also used to model a soft pneumatic actuator (SPA)
[89]. The problem with the FEM is the high dimensional space of the obtained model. A common solution is to reduce the domain of the model in order to achieve high computational efficiency, without sacrificing accuracy
[90][91]. The reduced-order model can help with the development of low-order controllers and observers based on a linearized model of the system
[92][93]. One group developed a dynamic simulation tool for articulated soft robots based on numerical simulation methods for slender structures
[94]. Another method uses genetic algorithms for dynamic parameters estimation of an octopus-inspired robot
[95].
Controllers developed from the static kinematic models obtained using the described methods are considered static controllers that discard the underlying dynamics of the system. Developing high-order dynamic models for soft robots is difficult and computationally expensive for controllers. A first-order dynamic modeling approach is proposed to reduce the computational space without affecting the controller’s performance
[96].
The model-free methods for modeling soft robots mainly use data-driven machine learning and deep learning techniques to find a mapping between the inputs and outputs of the soft system
[97]. Input actuation signals and robot states can be obtained by sensors, either embedded or external visual tracking sensors. The data can then be used with different supervised learning, unsupervised learning, and reinforcement learning techniques to develop models and controllers for soft robots. Examples of commonly used techniques include feedforward neural network (FNN)
[98][99][100][101], recurrent neural network (RNN)
[102], convolutional neural network (CNN)
[103], and echo state network (ESN)
[98], based on the reservoir computing framework. FNNs are widely used for modeling soft systems. One approach is the use of an FNN to model the work envelope of an SPA with a variable inclination angle
[104]. Another group used FEM-generated training data to learn the kinematic model of a 3D motion SPA using an FNN
[100]. An RNN used sensory data from cPDMS resistive sensors and a load cell to predict the deformation and force models of a soft pneumatic finger
[102]. A reservoir computing approach with the ESN architecture was used to model the 2D motion of a bioinspired turtle actuated through soft pneumatic flippers
[98].
Learning-based techniques also proved to be successful in learning the dynamic models of soft robots. One approach involved using a nonlinear autoregressive exogenous (NARX) model to develop a dynamic model for a soft manipulator, which was used to implement a task space controller
[105]. A deep neural network (DNN) model learned the non-linear dynamics for a single degree of freedom inflatable pneumatic robot. The model was used to implement a model predictive control (MPC) algorithm for pressure control
[106].
3.4. Control
Models of soft robots developed using the discussed approaches are used to develop kinematic and dynamic control for these complex systems. Model-based controllers rely on models obtained from analytical kinematic methods such as PCC, whereas model-free controllers use data-driven techniques
[80]. Different control algorithms are used, depending on the level at which the controller operates. Low-level controllers drive the actuators, whereas mid-level ones are responsible for the kinematic and dynamic control, and high-level control involves advanced trajectory and path planning for tasks such as obstacle avoidance.
The main task of soft robots’ controllers is to manage the whole-body deformation of the robot; it may also include controlling the exact position and orientation of an end effector in the case of soft manipulators. One approach uses open-loop control for a soft manipulator
[107]. The implemented open-loop dynamic controller uses a data-driven model with only mechanical feedback. The swimming eel-like robot
[108] was modeled using the geometrically exact beam theory with a torque control algorithm. A different approach uses an energy-shaping approach to develop the control law for a soft continuum manipulator
[109]. MPC is another technique that was employed for the control of large-scale soft robots
[110]. The MPC algorithm relies on a PCC model alongside a kinematic representation for efficient state prediction. A model reference predictive adaptive control (MRPAC) was also implemented on the same model and showed robustness to model uncertainties. Closed-loop control methods have also been demonstrated for the position control of soft robots
[111][112].
One of the most promising approaches for model-free control in the robotics field, in general, is reinforcement learning (RL), which has been proven successful for soft robots
[113]. RL can be described as an iterative learning process where the agent takes an action, its new state is observed, and a response in the form of a reward function is given to it based on the resulting interaction with its environment. When the agent learns a policy to map appropriate state-action pairs, then the learning is successful. RL can be implemented regardless of whether a model of the system is known. The use of deep reinforcement learning (DRL) and imitation learning algorithms in the development of soft robots has been shown in several experimental examples. DRL methods are now integrated with soft robots in various applications such as biomedical and edible robotics. Some research focused on the deep Q-network (DQN) algorithm, which was used in a soft robotic fish used for underwater exploration
[29]. Other common algorithms are deep deterministic policy gradient (DDPG), normalized advantage function (NAF), and advantage actor-critic (A2C). However, problems arise from the differences between the simulation environments where the robots are trained and the real-world environment. The use of generative adversarial network (GAN) is a suggested solution to help perform domain adaptation and narrow the gap between the simulation and real-world environments. Imitation learning is also beneficial when it is difficult to formulate a reward function to train a DRL model. The imitation learning algorithms use demonstrations constructed by an expert agent that are transferred to the soft robot. The most common imitation learning approaches are behavior cloning, inverse reinforcement learning, and generative adversarial imitation learning. The future scope is the combination of both deep reinforcement learning and imitation learning algorithms to benefit from both approaches and train better agents. One example is a soft artificial-muscle-driven robot mimicking cuttlefish actuated by a dielectric elastomer (DE) membrane
[55]. The robot uses reinforcement learning to provide the actuation. To simplify the problem, only two actions are considered, with only two voltage amplitudes: 0 and 6.8 kV. The robot is trained through trial-and-error interaction with its environment in order to find an optimal policy to maximize its reward function, which is the displacement at each time step. The displacement is monitored using a camera and then fed to the reinforcement learning algorithm.
Applying reinforcement learning with soft robots is a costly operation due to their non-linear hyperelastic properties. To overcome this issue, the reinforcement learning approach used to control the Honeycomb Pneu-Nets Soft Robot ignores the specific properties of the materials and the structural characteristics of the robot
[114], concentrating only on the geometric model, which simplifies the modeling task. Another challenge to implementing RL is the lack of accurate simulators for soft robots. To overcome this problem, researchers performed the training process on both the simulation as well as the physical hardware to obtain a more realistic control policy that works well with the actual robot. The reinforcement learning framework consists of two parts: formulating the set of representations for the robot’s states and actions, and the training process to search for accurate results in the problem space. The physical hardware uses air pumps and valves to achieve the actuation needed and uses the OptiTrack motion capture system for visual sensing to determine the actual state of the robot. Two methods of control were used to execute the learned RL policy: open-loop and closed-loop. The closed-loop method obtains the actual robot state from the sensor as the input to the trained control policy function, while in the open-loop method, the robot’s state is obtained from the simulator, leading to some errors.
Despite providing a good solution in many cases, RL may be impractical to implement when the reward function cannot be clearly defined. Imitation learning solves this issue by using demonstrations performed by an expert agent. One group used imitation learning to perform motion control and trajectory planning for soft continuum robots
[115]. They proposed the learning from demonstration (LfD) approach and implemented it on the Bionic Handling Assistant (BHA) robot, which resembles an elephant trunk. One of the simplest methods to implement LfD is kinesthetic teaching, which is achieved by directly recording demonstrations on the target robot to collect the position and orientation data of the end effector. The problem of using kinesthetic teaching with soft robots arises from the dynamic complexity of non-linear elasticity associated with these soft materials. They proposed an active compliant control to record the demonstrations during kinesthetic teaching, then encoded the recorded data with a task-parametrized Gaussian mixture model (TP-GMM).
A similar approach was used with the soft cylindrical robot arm STIFF-FLOP
[116]. However, the researchers worked on transferring the movement patterns of an octopus arm to the STIFF-FLOP. The octopus arm movements were obtained from a database of the cartesian position of several octopuses’ arms with an average of 100 points along the arm. They exploited several methods for the representation of the octopus reaching motion to help transfer the arm movements to the soft robot. They considered spatiotemporal representation and dynamical movement primitives to allow for a more robust movement transfer. They then used Gaussian mixture regression (GMR) for encoding, and applied a self-refinement algorithm with a weighted reward function according to different tasks.
Following a similar thought process to imitation learning, a trending new paradigm called morphological computation aims to learn from living creatures, but on the level of morphology instead of the locomotion level. Morphological computation is a process related to embodied intelligence, where some of the computation needed for actuation and perception is conducted automatically by the body of the living creature instead of sending sensory information to the brain and waiting for a control signal
[117]. Morphological computation can be exploited in the context of soft robots to offload some of the control to the body. This process is possible owing to the adaptable compliance exhibited by soft materials, which act as a reservoir computer that can process inputs from its environment and take appropriate actions.
A morphological computation framework with a mathematical realization was proposed using a reservoir of recurrent non-linear mass-spring systems, which is a model for actual physical soft bodies
[118]. The model demonstrated the ability to learn the end-effector trajectory of a robotic arm. The group also showed that adding feedback to the morphological computation system allows it to perform autonomous periodic patterns such as the ones responsible for locomotion
[119].