Logo-Graphenus-negative

Features and tools

FUNCTIONAL ARCHITECTURE

End-to-end data management and control: from infrastructure management and monitoring to storage, 

data analysis and governance

  • High-availability distributed storage

 

  • Distributed batch and streaming data processing

 

  • Data representation and SQL queries

 

  • Building notebooks to perform computing, data science or machine learning tasks

 

  • Access to different data sources, including in real time

 

  • User interface for interacting with the distribution tools
  • Process planning

 

  • Monitoring the use and operation of services

 

  • Security layer for the protection of data access based on sensitivity, authentication and authorisation management.

 

  • Data governance throughout the information processing cycle

FUNCTIONAL ARCHITECTURE

security-governance

Roadmap

and updates

Two evolutionary lines, several annual releases

Graphenus' release policy reduces the risk of technological obsolescence by continuously incorporating new capabilities. 

precipitated_vessel

Experimental

  • New developments

 

  • New versions of the components

 

  • Proofs of concept

 

  • Own team looking for possible functionalities or new tools

 

  • Release period: 3 months
flag

Stable

  • Fully stable software

 

  • Promoted from the experimental version

 

  • Seamlessly integrated with the rest of the components

 

  • Release period: 6 months.

Promotion: Every 3 months we analyse which components and developments have sufficient stability to move from the experimental version to the stable version, generating a new release.

PRODUCT ROADMAP

FUNCIONALITY

circle-g

Data Governance:

- Linkedin Datahub update.

 

Security:
- Ranger Audit (elastic).

 

Interoperability:
- GAIA X Inception
- Nifi

 

Storage:
- Ozone

 

Administration:
- Graphenus Manager:

Service management and centralised access

circle-g

Security:

- Ranger Policy Share.

Administration:

- Graphenus Manager: Centralisation of logs and metrics

Machine Learning:

- Inclusion of new machine learning libraries

Interoperability:

- GAIA X Ready

- PowerBI & Qlik Integration

SW Base & Infra:

- Adaptation to Rocky 8

- Kubernetes Inception

circle-g

Administration:

- Graphenus Manager:

Configurations and versioning

Sandbox:

- Sandbox availability public

SW Base & Infra:

- Adaptation to Rocky 8

- Kubernetes Ready

circle-g

Security:
- Automatic management of principals, keytabs and certificates.

Interoperability:
- Flink
- Apache Iceberg
- Cassandra

EXPERIMENTAL

2022-3T

2022-4T

2023-1T

2023-2T

STABLE

2022-4T

2023-1T

2023-2T

2023-3T

Logo-Graphenus-negative
20 open source tools,
a comprehensive solution

Access to different data sources, including in real time.

TRINO | SPARK | KAFKA

High-availability distributed storage.

HDFS

Container management.

DOCKER SWARM

Distributed batch and streaming data processing.

YARN | SPARK

Process planner

AIRFLOW

Data representation and SQL queries.

HIVE | TRINO

Build notebooks for computing, data science or machine learning.

JUPYTER NOTEBOOKS

User interface for interacting with the tools of the distro.

HUE

Monitoring of the use and operation of all services.

CADVISOR , LOKI , PROMETHEUS , GRAFANA

Identification, authentication and authorisation of users.

FREEIPA | KERBEROS | KNOX | KEYCLOAK | RANGER | RANGER

Data governance throughout the information processing cycle

Linkedin DataHub