DEV Community

Madhuja Mitra
Madhuja Mitra Subscriber

Posted on

Understanding Abstraction and Remote Procedure Calls in Distributed Systems

Distributed systems are inherently complex due to the need for multiple components to communicate over a network. Two key concepts that help manage this complexity are abstraction and Remote Procedure Calls (RPCs). This article explores both abstraction and RPC in distributed systems, highlighting their significance and usage.

Abstraction in Distributed Systems

Abstraction in software development involves hiding complex details to present a simpler interface for users and developers. In distributed systems, abstraction is crucial for several reasons:

Encapsulating Complexity: Distributed systems involve numerous components that must interact over networks, manage data consistency, and handle failures. Abstraction simplifies these complexities by providing models like APIs or service interfaces, allowing developers to focus on higher-level functionalities without delving into the intricate details of the underlying processes.
Separation of Concerns: Abstraction enables different system parts, such as computation, networking, and storage, to be managed independently. This separation allows developers to concentrate on specific tasks without needing to understand how other system parts operate.
Middleware as an Abstraction Layer: Middleware provides common services to applications, hiding the complexities of communication, data management, and message passing. This allows developers to build applications without dealing with low-level issues.
Handling Faults and Consistency: Abstraction helps maintain reliability and data consistency in distributed systems by offering built-in mechanisms like replication and consensus algorithms, which developers can leverage without having to implement them from scratch.
Resource Management: Cloud platforms use abstraction to simplify resource management, such as virtual machines or containers, allowing developers to use resources without worrying about hardware specifics.

Remote Procedure Call (RPC)

RPC is a protocol that enables a program to execute a procedure on a different machine over a network, mimicking a local procedure call. It simplifies distributed computing by abstracting network communication details and providing a straightforward interface for developers.

Transparency and Simplicity: RPC aims to make remote calls appear like local ones, hiding complexities such as network communication and data serialization, thereby easing the development of distributed systems.
Client-Server Model: RPC operates on a client-server model where the client requests a procedure execution on the server, which processes the request and returns the result to the client.
Stubs and Skeletons: RPC uses client and server stubs. The client stub packages the procedure's parameters and sends them over the network, while the server stub unpacks the message and executes the procedure.
Marshalling and Unmarshalling: These processes involve converting data to a network-friendly format (marshalling) and back to a usable format on the receiving end (unmarshalling).
Error Handling: RPC must manage errors like network failures and server crashes, which adds complexity compared to local calls.
Popular RPC Frameworks: Frameworks like gRPC, JSON-RPC, and Apache Thrift help implement RPC, each offering unique features and supporting various use cases.

Conclusion
Abstraction and RPC are essential for simplifying the development and management of distributed systems. Abstraction hides the complex details of system architecture, while RPC simplifies communication between system components. Together, they enable developers to build robust, scalable, and efficient distributed applications without getting overwhelmed by the intricate details of network communication and system management.

Reference :
https://www.educative.io/courses/grokking-modern-system-design-interview-for-engineers-managers/network-abstractions-remote-procedure-calls

Top comments (0)