At the 2024 Slush Conference, Emil Eifrem, Co-founder and CEO of Neo4j, shared how graph databases are revolutionizing data analytics. Neo4j, headquartered in Silicon Valley, powers critical use cases from the Panama Papers investigation into tax evasion to NASA’s mission to Mars and enterprise adoption of Generative AI. Known for its graph database and analytics technology to uncover relationships in data, Neo4j has become essential for complex data-driven challenges involved with modern applications like fraud detection, supply chain, and generative AI, with Gartner predicting widespread adoption by 2025. In this interview, Emil discusses Neo4j’s open-source origins, AI integration, and advice for enterprise CEOs and startup founders, offering valuable insights into the future of data-driven innovation.
What were some challenges in the early days of Neo4j that turned into opportunities for product development and go-to-market strategies?
One of the biggest opportunities and challenges in the early days was figuring out how to build a company around an open-source product. From the beginning, we had the Neo4j Community Edition, which was free and open source. Anyone could download it, experiment with it, and build applications—without even needing to register. This accessibility created a grassroots movement. For example, in 2019, there were 500 independent events related to Neo4j, like meetups and webinars, with most organized spontaneously by the community.
However, building a business on open source is not straightforward because you’re giving away a significant portion of your product for free. The solution was to identify features that enterprises valued—features like LDAP and Kerberos integration, which are critical for enterprise ecosystems but less relevant for independent developers or startups. This segmentation allowed us to distinguish between users with more time than money and those with more money than time. The former includes students and independent developers, for whom the product is free. The latter—large enterprises—are willing to pay for features that accelerate their core business development.
The key philosophy is to build a thriving ecosystem by giving the product for free to those with more time than money while monetizing features that enterprises need.
How did you balance community-driven growth with business development?
We were very thoughtful and intentional about this balance. Growing up in the open-source ecosystem, I had experience thinking about monetizing open-source software. It’s a two-stage process: first, achieving product-market fit for the free version by proving the core value of graph databases; second, achieving product-market fit for monetization by identifying features valuable to enterprises. This strategy allowed us to separate the user base into those we could monetize and those who would contribute to the community’s growth.
How do you see your user base today?
Our user base splits along two axes: startups versus enterprises and developers versus data scientists. For startups, we support adoption rather than monetization. We have a startup program and a free tier in our cloud offering, Aura, which provides an entry-level option for as little as $65 per month.
For enterprises—primarily the Global 2000—our focus is on monetization. These organizations value features that integrate with their complex ecosystems and infrastructure.
In terms of user demographics, roughly 50-60% are developers and application owners and 40-50% are data scientists.
For startup founders building social networks, how do graph databases compare to relational databases?
A graph model is inherently better suited for applications like social networks due to its ability to handle connected data efficiently. Unlike relational databases, which can struggle with complex queries and relationships, graph databases excel at modeling and querying relationships. This makes them a natural fit for applications such as social networks, recommendation engines, and fraud detection.
However, many startups begin with relational databases due to familiarity and existing expertise. Often, they transition to graph databases as their needs grow more complex, particularly when they hit the limitations of relational models in handling connected data.
For new founders, adopting a graph database model early could save significant re-engineering effort down the road, provided they are willing to invest in acquiring the necessary skills. Neo4j, for example, provides ample resources and community support to help teams learn and implement graph databases.
Why should startups choose graph databases over relational ones for applications like social networks?
There are two core arguments, with a bonus point:
1. Ease of Development:
Graph databases map naturally to domains involving connections and relationships. In a social network, nodes represent users, and relationships capture interactions like friendships or follows. While relational databases can handle such data, they require numerous joins between tables and complex translations, which add significant development time. For startups, where speed to market is critical, graph databases allow faster iteration and development.
2. Advanced Insights:
Graph databases offer powerful native algorithms, like PageRank for finding influential users or Louvain clustering for identifying communities, which are difficult or impossible to achieve within relational databases. These capabilities enable insights that directly enhance user engagement and application functionality.
3. Future-Proofing with AI (Bonus):
Modern graph tools integrate with AI technologies. For instance, Neo4j’s integration with large language models (LLMs) allows you to ask natural language questions like, “Who is the best match between a founder and an investor?” The system generates graph queries, making the technology accessible even for those without extensive graph expertise.
What’s the current landscape for integrating Neo4j with modern frameworks?
Neo4j, being open-source and widely adopted, integrates with most programming languages and frameworks. Thanks to the large developer community, mature integrations exist for popular stacks like Django, Ruby on Rails, and others. The maturity of specific integrations depends on the framework’s popularity—highly used frameworks tend to have better-developed connectors. Additionally, Neo4j supports all major cloud providers, including Google Cloud, AWS, and Azure.
As graph databases continue to evolve, standards are also emerging. Neo4j is actively involved in shaping the future of graph query languages, such as the ongoing work on the GQL International Standard for graph query languages.
Do you expect graph databases to overtake relational databases?
Relational databases will remain a cornerstone of data infrastructure, particularly for tabular, structured data like payroll systems or simple CRUD applications. However, modern domains involving connected data—such as e-commerce recommendations, social networks, and fraud detection—are better served by graph databases. Most new applications will likely adopt graph databases because they reflect the connected nature of today’s data and provide unique analytical capabilities.
What role do graph databases play in AI, particularly with Gen AI?
The killer application of generative AI in enterprises is giving large language models (LLMs) access to internal enterprise data. This has evolved through stages:
- Fine-Tuning (Early 2023):
Initially, fine-tuning was the solution, but it required specialized expertise, constant retraining as data changed, and lacked granular access controls. - RAG Architecture (Mid to Late 2023):
Retrieval-Augmented Generation (RAG) emerged as a better approach. RAG combines off-the-shelf LLMs with data retrieval from a database (like Neo4j). This allows the LLM to generate insights using up-to-date secure enterprise data without retraining.
Graph databases, like Neo4j, are critical in RAG (also referred to as GraphRAG) because knowledge graphs built on them excel at managing relationships and context-rich queries, which are essential for tasks like understanding how internal data points interconnect. They are also proven to make GenAI results accurate, transparent, and explainable to normal humans. Those benefits are huge, and why graph is an essential part of the data stack today.
How is Neo4j addressing AI challenges?
Neo4j integrates deeply with AI workflows. For example, users can input natural language queries about their business, and the system uses LLMs to generate complex Cypher queries. This lowers the barrier to adoption for non-technical users and aligns graph databases with the AI-driven future of enterprise applications.
Takeaways from the Conversation
This interview highlighted several key insights:
- Open Source as a Business Model:
Emil Eifrem provided a compelling perspective on how Neo4j leverages open source to foster community engagement while strategically monetizing enterprise-specific features. - Graph Databases and AI Integration:
Neo4j’s graph model aligns naturally with the interconnected structure of real-world data, making it a superior choice for applications using social networks and AI use cases. The integration of graph databases with AI technologies, particularly Retrieval-Augmented Generation (RAG) with GraphRAG, showcases how Neo4j enables enterprises to extract insights and deliver explainable, secure results. - Klarna Case Study:
Klarna’s AI chatbot, powered by Neo4j, serves as a prime example of real-world AI ROI. The “Kiki” chatbot, integrated with Klarna’s knowledge graph, is transforming the way the company collaborates and improves productivity. As Sebastian Siemiatkowski, Co-Founder and CEO of Klarna, explains:
“At Klarna, we’re transforming the way we collaborate with our GenAI chatbot Kiki, powered by Neo4j’s knowledge graph. Kiki brings together information across multiple disparate and siloed systems, improves the quality of that information, and explores it, enabling our teams to ask Kiki anything from resource needs to internal processes to how teams should work. It’s having a huge impact on productivity in ways that were not possible to imagine before without graph and Neo4j.”
This case study demonstrates the benefits of graph technology in driving business impact and highlights how Neo4j is scaling as a company. In 2024, Neo4j achieved a significant revenue milestone, reflecting the growing demand for its graph database solutions across industries.
- Cultural and Regional Insights:
Emil emphasized Silicon Valley’s continuing dominance as an innovation hub, particularly in the AI space, while acknowledging emerging ecosystems in cities like Paris and tech-forward regions in Asia. His perspective on cultural work ethics and regulatory differences between Europe and the U.S. offered a nuanced view of the challenges and opportunities for entrepreneurs in different regions. - Practical Advice for Founders:
Emil advised early-stage founders to immerse themselves in Silicon Valley for its ecosystem advantages while scaling engineering teams beyond the Valley to attract and retain talent. His insights reflect a balanced approach to leveraging the best of both worlds.