🔍Log Lens

Welcome to Log Lens, a powerful log management system designed to efficiently handle vast volumes of log data. This project encompasses a robust log ingestor and a user-friendly query interface, providing seamless log analysis capabilities. The following guide will help you understand the project, run it successfully, and explore its features.

⚙️ Features Implemented

Log Ingestor
Ability to send http response
Added Kafka, Logstash , elasticsearch, Postgres & Kibana
Highly scalable and fault tolerant
Query Interface

🏛️ Proposed Architecture

Components Overview

Log Ingestion Layer
- Nginx: Serves as the web server for handling HTTP log ingestion. Ensures secure and efficient transfer of logs into the system.
Message Queue
- Kafka: Implemented as a distributed event streaming platform to enable real-time processing and parallel ingestion of logs. Enhances fault tolerance and ensures data durability.
Log Processing and Storage
- Logstash: Ingests and processes logs from various sources, providing flexibility in log handling and transformation.
- Postgres: Utilized as a relational database for structured storage of log data. Ensures data integrity through ACID compliance and supports complex queries.
- Elasticsearch: Stores log data efficiently and facilitates powerful search and analysis capabilities. Integrated with Kibana for real-time visualization.

🌟 Proposed Features

Fault Tolerance:
- Distributed components and Kafka's replication ensure the system can continue functioning in the face of failures.
- Database replication in Postgres enhances data availability and fault tolerance.
Scalability:
- Elasticsearch and Kafka support horizontal scaling, allowing the system to handle increasing log volumes seamlessly.
- Auto-scaling mechanisms can be implemented for ELK components to dynamically adapt to varying workloads.
Real-time Processing:
- Kafka's event streaming platform enables parallel ingestion, reducing latency and providing real-time log processing.
- Integration with Kibana allows users to visualize and analyze log data as it is ingested.
Security:
- Implementation of SSL/TLS encryption for communication between components.
- Robust access control mechanisms for Elasticsearch, Kafka, and Postgres to ensure data privacy.
Monitoring and Logging:
- Centralized logging setup for comprehensive monitoring, debugging, and performance analysis.
- Introduction of an alerting system for real-time notifications about critical events.
Documentation and Knowledge Transfer:
- Enhancement of system documentation to provide detailed insights into architecture, deployment procedures, and troubleshooting steps.
- Conducting training sessions for the operations and development teams to facilitate a deeper understanding of the technologies used.
Containerization and Orchestration:
- Consideration of containerization using Docker for improved portability.
- Exploration of orchestration tools such as Kubernetes for automated deployment, scaling, and management of containerized components.

The proposed architecture aims to build upon the existing system's strengths, addressing areas for improvement and introducing features that enhance fault tolerance, scalability, and real-time processing. This comprehensive approach ensures the Log Ingestor and Querying System remains resilient, adaptable, and efficient in handling diverse log data.
Above arhitecture is suitable for handling more than 1 Million + users with being both fault tolerant and resilient.

🚀 Getting Started

Prerequisites

Node
Docker
Docker Compose

Installation

Below is an example of how you can instruct your audience on installing and setting up your app. This template doesn't rely on any external dependencies or services.

Get a free API Key at https://example.com

Clone the repo

git clone https://github.com/dyte-submissions/november-2023-hiring-deveshXm.git

Start Docker Compose file
```
docker compose up
```

(back to top)

🔄 Usage

Produce logs at http://localhost:3000/logs

  curl -X POST \
  http://localhost:3000/logs \
  -H 'Content-Type: application/json' \
  -d '{
        "level": "info",
        "message": "Application started successfully",
        "resourceId": "server-5678",
        "timestamp": "2023-11-19T12:34:56Z",
        "traceId": "def-uvw-456",
        "spanId": "span-789",
        "commit": "a1b2c3d",
        "metadata": {
            "parentResourceId": "server-1234"
        }
      }'

Query into Elastic Search at http://localhost:3001

  GET http://localhost:3001/search?{Query Params}

Query Params
- level
- message
- resourceId
- timestampStart
- timestampEnd
- traceId
- spanId
- commit
- parentResourceId

Example Request

  GET http://localhost:3001/query?resourceId=server-1&timestampStart=2023-08-15T00:00:00Z&timestampEnd=2023-09-15T23:59:59Z`

Real Time log at Kibana

  http://localhost:5601/app/logs/stream?flyoutOptions=(flyoutId:!n,flyoutVisibility:hidden,surroundingLogsId:!n)&logFilter=(expression:%27%27,kind:kuery)&logPosition=(end:now,position:(tiebreaker:6,time:1700383920183),start:now-1d,streamLive:!f)

Access Dashboard UI at http://localhost:4173

(back to top)

🛠️ Technologies Used

Frontend

Technology Used	Reason
ReactJS	UI Development
Vite	Fast Development
Tailwind CSS	Styling Efficiency
Axios	HTTP Requests

Backend

Technology Used	Reason
Node.js	Server-Side JavaScript and Backend Development
Express	Minimalist Web Application Framework for Node.js
PostgreSQL	Robust Relational Database Management System
Elasticsearch	Distributed Search and Analytics Engine
Kibana	Real Time Log Analysis
Logstash	Data Processing and Ingestion Tool for Elasticsearch
Pino	Fast and Low Overhead Node.js Logger
Apache Kafka	Distributed Streaming Platform for Real-Time Data
Docker	Container Orchestration
NGINX	Load Balancer and Web Server

🏆 Why this Architecure

Fault Tolerance

Distributed Components: The use of ELK stack, Postgres, Nginx, and Kafka involves distributed components, contributing to fault tolerance. In the event of a failure in one component, the system can continue to function without a complete breakdown.

Kafka's Replication: Kafka's replication capabilities ensure that log data is not lost even if one or more Kafka brokers experience failures. This redundancy enhances the overall fault tolerance of the system.

Data Integrity with Postgres: Postgres, being ACID compliant, ensures data integrity even in the face of unexpected failures, preventing data corruption and loss.
Scalability

Elasticsearch Scaling: ELK stack's Elasticsearch is designed to scale horizontally, allowing the system to efficiently handle increased log volumes. As the data load grows, additional Elasticsearch nodes can be added to distribute the workload.

Kafka's Horizontal Scaling: Kafka's distributed architecture supports horizontal scaling, enabling the system to handle growing traffic by adding more Kafka brokers. This ensures the scalability required for a log ingestion system.
Real-time Processing

Kafka's Event Streaming: Kafka's event streaming platform facilitates real-time processing of logs. It enables parallel ingestion of logs, reducing latency and ensuring that log data is available for analysis almost immediately.

Kibana Visualization: The integration with Kibana in the ELK stack provides real-time visualization capabilities, allowing users to monitor and analyze log data as it is ingested.
Data Integrity and Structured Storage

Postgres Relational Model: Postgres is employed for its relational model, allowing for structured and organized storage of log data. This facilitates complex queries, joins, and ensures data integrity through ACID compliance.

Elasticsearch Search and Analysis: Elasticsearch's powerful search and analysis capabilities contribute to efficient data retrieval, supporting a wide range of queries for effective log analysis.
Conclusion

The chosen architecture combines fault tolerance, scalability, and real-time processing to create a robust Log Ingestor and Querying System. By leveraging the strengths of ELK stack, Postgres, Nginx, and Kafka, the solution ensures data reliability, adaptability to changing workloads, and timely availability of log data for analysis.

📊 Benchmarking

For testing a Node.js application to assess its performance and scalability some of the tools that I can use are

Grafana K6

🔄 How it can be Improved Further

Fault Tolerance

Distributed Components: The use of ELK stack, Postgres, Nginx, and Kafka involves distributed components, contributing to fault tolerance. In the event of a failure in one component, the system can continue to function without a complete breakdown.

Kafka's Replication: Kafka's replication capabilities ensure that log data is not lost even if one or more Kafka brokers experience failures. This redundancy enhances the overall fault tolerance of the system.

Data Integrity with Postgres: Postgres, being ACID compliant, ensures data integrity even in the face of unexpected failures, preventing data corruption and loss.
Scalability

Elasticsearch Scaling: ELK stack's Elasticsearch is designed to scale horizontally, allowing the system to efficiently handle increased log volumes. As the data load grows, additional Elasticsearch nodes can be added to distribute the workload.

Kafka's Horizontal Scaling: Kafka's distributed architecture supports horizontal scaling, enabling the system to handle growing traffic by adding more Kafka brokers. This ensures the scalability required for a log ingestion system.
Real-time Processing

Kafka's Event Streaming: Kafka's event streaming platform facilitates real-time processing of logs. It enables parallel ingestion of logs, reducing latency and ensuring that log data is available for analysis almost immediately.

Kibana Visualization: The integration with Kibana in the ELK stack provides real-time visualization capabilities, allowing users to monitor and analyze log data as it is ingested.
Data Integrity and Structured Storage

Postgres Relational Model: Postgres is employed for its relational model, allowing for structured and organized storage of log data. This facilitates complex queries, joins, and ensures data integrity through ACID compliance.

Elasticsearch Search and Analysis: Elasticsearch's powerful search and analysis capabilities contribute to efficient data retrieval, supporting a wide range of queries for effective log analysis.
Conclusion

The chosen architecture combines fault tolerance, scalability, and real-time processing to create a robust Log Ingestor and Querying System. By leveraging the strengths of ELK stack, Postgres, Nginx, and Kafka, the solution ensures data reliability, adaptability to changing workloads, and timely availability of log data for analysis.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
client		client
images		images
nginx		nginx
server		server
LICENSE.txt		LICENSE.txt
README.md		README.md
docker-compose.yml		docker-compose.yml
package-lock.json		package-lock.json
sample_request.js		sample_request.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🔍Log Lens

Contents

⚙️ Features Implemented

🏛️ Proposed Architecture

Components Overview

Log Ingestion Layer

Message Queue

Log Processing and Storage

🌟 Proposed Features

🚀 Getting Started

Prerequisites

Installation

🔄 Usage

Query Params

🛠️ Technologies Used

Frontend

Backend

🏆 Why this Architecure

Fault Tolerance

Scalability

Real-time Processing

Data Integrity and Structured Storage

Conclusion

📊 Benchmarking

🔄 How it can be Improved Further

Fault Tolerance

Scalability

Real-time Processing

Data Integrity and Structured Storage

Conclusion

📖 References Used

About

Releases

Packages

Contributors 2

Languages

License

deveshXm/LogLens

Folders and files

Latest commit

History

Repository files navigation

🔍Log Lens

Contents

⚙️ Features Implemented

🏛️ Proposed Architecture

Components Overview

Log Ingestion Layer

Message Queue

Log Processing and Storage

🌟 Proposed Features

🚀 Getting Started

Prerequisites

Installation

🔄 Usage

Query Params

🛠️ Technologies Used

Frontend

Backend

🏆 Why this Architecure

Fault Tolerance

Scalability

Real-time Processing

Data Integrity and Structured Storage

Conclusion

📊 Benchmarking

🔄 How it can be Improved Further

Fault Tolerance

Scalability

Real-time Processing

Data Integrity and Structured Storage

Conclusion

📖 References Used

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages