Abstract
IoT domains such as health-care, automation and control, augmented and virtual reality etc are incubating novel applications and devices everyday. The data generated in these domains must be processed under strict delay requirements. Beyond the processing deadlines, the information loses its value. Apart
from that, processing resources are required to be available all the time and Cloud’s or communication failure must not hinder the services provided by such IoT applications. So, reliability is one of the major concerns for all these IoT applications. Considering all these requirements, it is been observed that instead of processing such data at the far Cloud, it is beneficial to process it closer to the source. This led to a new paradigm called Edge Computing. Edge computing has emerged as an effective solution for delay sensitive IoT applications. In the Edge-Cloud hierarchy, reliability and fault tolerance are the major issues. This thesis proposes a novel fault-tolerant and reliable hierarchical IoT-Cloud architecture
which can survive the failures of the Cloud Server and the Edge Server(s). In the proposed architecture, the sensed data processing is distributed over four levels (Cloud-Fog- Mist-Dew) based on the level processing power and distance from the end IoT devices. This hierarchical architecture is an advancement over the existing IoT-Cloud architecture. The proposed system becomes reliable by replicating the model-files (generated after data training) and some relevant data at the Edge Server from the Cloud. It allows the Edge to generate feedback after receiving the sensed data, in case
of any event happens. It indeed improves the reliability of IoT applications by providing them services in the case of unavailability of the processing resources especially due to the communication failure with the Cloud Server. Although, hierarchical Edge-Cloud architecture resolves the problems of resource unavailability and
feedback delays, what if the Edge Server fails? It induces unheard issues of reliability and fault tolerance at the Edge. The system can not be completely reliable until the reliability issues at the edge of the network are resolved. This thesis proposes a novel mechanism using connection switching to deal with Edge Servers’ failures. In case of an Edge failure, IoT application is redirected to an alternate
available Edge Server. If there are multiple alternate Edge Servers available, redirection to a new Edge is decided based on the delay-tolerance of the IoT applications. It makes the entire system reliable and fault-tolerant.
The proposed IoT-Cloud architecture with Reliable-Edge has been implemented as an extension of Landslide Early Warning System (LEWS) to improve its reliability. LEWS is one of the most suitable
applications to apply the proposed mechanism(s). During laboratory experiments, results demonstrate that the system attains maximum availability of the computation resources. Along with this, results
prove its efficiency when the proposed system is analyzed theoretically and simulated using Matlab.
The thesis also contributes to the construction of a Reliable Edge Controller (EC) that reliably assigns any type of computational resources available at the edge to the IoT applications. A variety of devices available at the network access layer have been considered to be utilized to serve IoT applications. This
includes the use of devices available with private users, dedicated Edge Servers and Cloud infrastructure. The proposed system learns the optimal operating parameters during initial runs. Using the knowledge acquired in the learning phase, an integer linear programming problem is formulated to minimize the
Mean Time To Complete (MTTC) the request for all the IoT nodes. The solution of the formulated problem provides optimized resource allocation for all the IoT nodes. Later, considering the unreliable
nature of the privately owned devices, the learning and formulation has been extended to incorporate probability of failure of these devices. This evolves the EC into a reliable one.
Keywords: Internet of Things (IoT), Edge Computing, Real-time Applications, Reliability, Fault Tolerance, Cloud Computing, SDN