A challenge when setting up a data lake is the avoidance of a vendor lock-in , i.e. dependence on a service provider or manufacturer: The initial costs for setting up the system are manageable - however, more and more requirements are made over time and solved by individual extensions. The longer the system is in operation, the higher its value - both in terms of the data it contains and the hours worked in connecting data sources and analyzing data.
This is fine as long as you are satisfied with your service provider and the manufacturer of this solution. Should this no longer be the case, you must have the flexibility to switch to another provider with your existing system in order to secure your already extensive investment. This independence is only guaranteed if you consistently leverage open source systems such as the components of the SMACK stack (Spark, Mesos, Akka, Cassandra and Kafka) or the free variants of elastic software (e.g. ElasticSearch, <Logstash, Kibana). And to use service providers who implement them - like ESONO AG.
Contact us For more information on building a data lake, for which we offer different implementation concepts from mid-range to the enterprise segment.