Best DataGalaxy Alternatives in 2026
Find the top alternatives to DataGalaxy currently available. Compare ratings, reviews, pricing, and features of DataGalaxy alternatives in 2026. Slashdot lists the best DataGalaxy alternatives on the market that offer competing products that are similar to DataGalaxy. Sort through DataGalaxy alternatives below to make the best choice for your needs
-
1
DataHub
DataHub
10 RatingsDataHub is a versatile open-source metadata platform crafted to enhance data discovery, observability, and governance within various data environments. It empowers organizations to easily find reliable data, providing customized experiences for users while avoiding disruptions through precise lineage tracking at both the cross-platform and column levels. By offering a holistic view of business, operational, and technical contexts, DataHub instills trust in your data repository. The platform features automated data quality assessments along with AI-driven anomaly detection, alerting teams to emerging issues and consolidating incident management. With comprehensive lineage information, documentation, and ownership details, DataHub streamlines the resolution of problems. Furthermore, it automates governance processes by classifying evolving assets, significantly reducing manual effort with GenAI documentation, AI-based classification, and intelligent propagation mechanisms. Additionally, DataHub's flexible architecture accommodates more than 70 native integrations, making it a robust choice for organizations seeking to optimize their data ecosystems. This makes it an invaluable tool for any organization looking to enhance their data management capabilities. -
2
dbt
dbt Labs
239 Ratingsdbt Labs is redefining how data teams work with SQL. Instead of waiting on complex ETL processes, dbt lets data analysts and data engineers build production-ready transformations directly in the warehouse, using code, version control, and CI/CD. This community-driven approach puts power back in the hands of practitioners while maintaining governance and scalability for enterprise use. With a rapidly growing open-source community and an enterprise-grade cloud platform, dbt is at the heart of the modern data stack. It’s the go-to solution for teams who want faster analytics, higher quality data, and the confidence that comes from transparent, testable transformations. -
3
AnalyticsCreator
AnalyticsCreator
46 RatingsAccelerate your data journey with AnalyticsCreator—a metadata-driven data warehouse automation solution purpose-built for the Microsoft data ecosystem. AnalyticsCreator simplifies the design, development, and deployment of modern data architectures, including dimensional models, data marts, data vaults, or blended modeling approaches tailored to your business needs. Seamlessly integrate with Microsoft SQL Server, Azure Synapse Analytics, Microsoft Fabric (including OneLake and SQL Endpoint Lakehouse environments), and Power BI. AnalyticsCreator automates ELT pipeline creation, data modeling, historization, and semantic layer generation—helping reduce tool sprawl and minimizing manual SQL coding. Designed to support CI/CD pipelines, AnalyticsCreator connects easily with Azure DevOps and GitHub for version-controlled deployments across development, test, and production environments. This ensures faster, error-free releases while maintaining governance and control across your entire data engineering workflow. Key features include automated documentation, end-to-end data lineage tracking, and adaptive schema evolution—enabling teams to manage change, reduce risk, and maintain auditability at scale. AnalyticsCreator empowers agile data engineering by enabling rapid prototyping and production-grade deployments for Microsoft-centric data initiatives. By eliminating repetitive manual tasks and deployment risks, AnalyticsCreator allows your team to focus on delivering actionable business insights—accelerating time-to-value for your data products and analytics initiatives. -
4
MANTA
Manta
Manta is a unified data lineage platform that serves as the central hub of all enterprise data flows. Manta can construct lineage from report definitions, custom SQL code, and ETL workflows. Lineage is analyzed based on actual code, and both direct and indirect flows can be visualized on the map. Data paths between files, report fields, database tables, and individual columns are displayed to users in an intuitive user interface, enabling teams to understand data flows in context. -
5
OneTrust Privacy Automation
OneTrust
1 RatingTransparency, choice and control are key to trust. Organizations have the opportunity to leverage these moments to build trust, and provide more valuable experiences. People expect greater control over their data. We offer privacy and data governance automation to help organizations better understand and comply with regulatory requirements. We also operationalize risk mitigation to ensure transparency and choice for individuals. Your organization will be able to achieve data privacy compliance quicker and build trust. Our platform helps to break down silos between processes, workflows, teams, and people to operationalize regulatory compliance. It also allows for trusted data use. Building proactive privacy programs that are rooted in global best practice and not just reacting to individual regulations is possible. To drive mitigation and risk-based decision-making, gain visibility into unknown risks. Respect individual choice and integrate privacy and security by default in the data lifecycle. -
6
Mindfuel
Mindfuel
Mindfuel enables business and data teams to identify and prioritize the most significant use cases by assessing them in relation to company objectives and essential metrics. Facilitate a direct link between data and analytics products and business opportunities to improve decision-making processes and optimize resource distribution, guaranteeing that every product, from its initial idea to its implementation, showcases quantifiable value and influence on the business. Foster an environment of openness and strategic insight. Compile all available resources to enhance the discovery process and encourage the reuse of data products, ensuring consistent value creation and visibility throughout the entire journey from initial conception to final adoption, while also promoting collaboration among teams to drive innovation and efficiency. -
7
The Alation Agentic Data Intelligence Platform is designed to transform how enterprises manage, govern, and use data for AI and analytics. It combines search, cataloging, governance, lineage, and analytics into one unified solution, turning metadata into actionable insights. AI-powered agents automate critical tasks like documentation, data quality monitoring, and product creation, freeing teams from repetitive manual work. Its Active Metadata Graph and workflow automation capabilities ensure that data remains accurate, consistent, and trustworthy across systems. With 120+ pre-built connectors, including integrations with AWS, Snowflake, Salesforce, and Databricks, Alation integrates seamlessly into enterprise ecosystems. The platform enables organizations to govern AI responsibly, ensuring compliance, transparency, and ethical use of data. Enterprises benefit from improved self-service analytics, faster data-driven decisions, and a stronger data culture. With industry leaders like Salesforce and 40% of the Fortune 100 relying on it, Alation is proven to help businesses unlock the value of their data.
-
8
Databricks Data Intelligence Platform
Databricks
The Databricks Data Intelligence Platform empowers every member of your organization to leverage data and artificial intelligence effectively. Constructed on a lakehouse architecture, it establishes a cohesive and transparent foundation for all aspects of data management and governance, enhanced by a Data Intelligence Engine that recognizes the distinct characteristics of your data. Companies that excel across various sectors will be those that harness the power of data and AI. Covering everything from ETL processes to data warehousing and generative AI, Databricks facilitates the streamlining and acceleration of your data and AI objectives. By merging generative AI with the integrative advantages of a lakehouse, Databricks fuels a Data Intelligence Engine that comprehends the specific semantics of your data. This functionality enables the platform to optimize performance automatically and manage infrastructure in a manner tailored to your organization's needs. Additionally, the Data Intelligence Engine is designed to grasp the unique language of your enterprise, making the search and exploration of new data as straightforward as posing a question to a colleague, thus fostering collaboration and efficiency. Ultimately, this innovative approach transforms the way organizations interact with their data, driving better decision-making and insights. -
9
Atlan
Atlan
The contemporary data workspace transforms the accessibility of your data assets, making everything from data tables to BI reports easily discoverable. With our robust search algorithms and user-friendly browsing experience, locating the right asset becomes effortless. Atlan simplifies the identification of poor-quality data through the automatic generation of data quality profiles. This includes features like variable type detection, frequency distribution analysis, missing value identification, and outlier detection, ensuring you have comprehensive support. By alleviating the challenges associated with governing and managing your data ecosystem, Atlan streamlines the entire process. Additionally, Atlan’s intelligent bots analyze SQL query history to automatically construct data lineage and identify PII data, enabling you to establish dynamic access policies and implement top-notch governance. Even those without technical expertise can easily perform queries across various data lakes, warehouses, and databases using our intuitive query builder that resembles Excel. Furthermore, seamless integrations with platforms such as Tableau and Jupyter enhance collaborative efforts around data, fostering a more connected analytical environment. Thus, Atlan not only simplifies data management but also empowers users to leverage data effectively in their decision-making processes. -
10
Decube
Decube
Decube is a comprehensive data management platform designed to help organizations manage their data observability, data catalog, and data governance needs. Our platform is designed to provide accurate, reliable, and timely data, enabling organizations to make better-informed decisions. Our data observability tools provide end-to-end visibility into data, making it easier for organizations to track data origin and flow across different systems and departments. With our real-time monitoring capabilities, organizations can detect data incidents quickly and reduce their impact on business operations. The data catalog component of our platform provides a centralized repository for all data assets, making it easier for organizations to manage and govern data usage and access. With our data classification tools, organizations can identify and manage sensitive data more effectively, ensuring compliance with data privacy regulations and policies. The data governance component of our platform provides robust access controls, enabling organizations to manage data access and usage effectively. Our tools also allow organizations to generate audit reports, track user activity, and demonstrate compliance with regulatory requirements. -
11
Dataplex Universal Catalog
Google
$0.060 per hourDataplex Universal Catalog provides enterprise-wide visibility and governance for structured, semi-structured, and unstructured data. Its AI-powered semantic search allows users to query data in natural language, eliminating the need for complex search syntax. The platform enriches metadata with business context through glossaries, ownership attributes, and key usage details, supporting informed decision-making. It offers automated metadata ingestion, classification, and enrichment, reducing manual data management tasks. With built-in lineage tracking, organizations can trace data origins, transformations, and dependencies across multiple sources. BigQuery integration brings these governance capabilities directly into the analytics workflow, enhancing productivity. By connecting with BigLake, Dataplex extends governance to open lakehouses with Apache Iceberg and other engines. The result is a secure, scalable foundation for managing data-to-AI lifecycles across cloud-native and open-source ecosystems. -
12
Aggua
Aggua
Aggua serves as an augmented AI platform for data fabric that empowers both data and business teams to access their information, fostering trust while providing actionable data insights, ultimately leading to more comprehensive, data-driven decision-making. Rather than being left in the dark about the intricacies of your organization's data stack, you can quickly gain clarity with just a few clicks. This platform offers insights into data costs, lineage, and documentation without disrupting your data engineer’s busy schedule. Instead of investing excessive time on identifying how a change in data type might impact your data pipelines, tables, and overall infrastructure, automated lineage allows data architects and engineers to focus on implementing changes rather than sifting through logs and DAGs. As a result, teams can work more efficiently and effectively, leading to faster project completions and improved operational outcomes. -
13
Data360 Govern
Precisely
Your organization recognizes the significance of data and the importance of making it accessible to business users for optimal effectiveness; however, without proper enterprise data governance, locating, comprehending, and trusting that data may pose challenges. Data360 Govern serves as a comprehensive solution for enterprise data governance, cataloging, and metadata management, enabling you to have confidence in your data's quality, value, and reliability. By automating governance and stewardship responsibilities, it equips you to address vital questions regarding your data's origin, usage, significance, ownership, and overall quality. Utilizing Data360 Govern allows for quicker decision-making regarding data management and usage, fosters collaboration throughout the organization, and ensures users can access the necessary answers promptly. Furthermore, gaining transparency into your organization's data ecosystem empowers you to monitor critical data that aligns with your key business objectives, ultimately enhancing strategic initiatives and fostering growth. -
14
Apache Atlas
Apache Software Foundation
Atlas serves as a versatile and scalable suite of essential governance services, empowering organizations to efficiently comply with regulations within the Hadoop ecosystem while facilitating integration across the enterprise's data landscape. Apache Atlas offers comprehensive metadata management and governance tools that assist businesses in creating a detailed catalog of their data assets, effectively classifying and managing these assets, and fostering collaboration among data scientists, analysts, and governance teams. It comes equipped with pre-defined types for a variety of both Hadoop and non-Hadoop metadata, alongside the capability to establish new metadata types tailored to specific needs. These types can incorporate primitive attributes, complex attributes, and object references, and they can also inherit characteristics from other types. Entities, which are instances of these types, encapsulate the specifics of metadata objects and their interconnections. Additionally, REST APIs enable seamless interaction with types and instances, promoting easier integration and enhancing overall functionality. This robust framework not only streamlines governance processes but also supports a culture of data-driven collaboration across the organization. -
15
Collibra
Collibra
The Collibra Data Intelligence Cloud serves as your comprehensive platform for engaging with data, featuring an exceptional catalog, adaptable governance, ongoing quality assurance, and integrated privacy measures. Empower your teams with a premier data catalog that seamlessly merges governance, privacy, and quality controls. Elevate efficiency by enabling teams to swiftly discover, comprehend, and access data from various sources, business applications, BI, and data science tools all within a unified hub. Protect your data's privacy by centralizing, automating, and streamlining workflows that foster collaboration, implement privacy measures, and comply with international regulations. Explore the complete narrative of your data with Collibra Data Lineage, which automatically delineates the connections between systems, applications, and reports, providing a contextually rich perspective throughout the organization. Focus on the most critical data while maintaining confidence in its relevance, completeness, and reliability, ensuring that your organization thrives in a data-driven world. By leveraging these capabilities, you can transform your data management practices and drive better decision-making across the board. -
16
Ataccama ONE
Ataccama
Ataccama is a revolutionary way to manage data and create enterprise value. Ataccama unifies Data Governance, Data Quality and Master Data Management into one AI-powered fabric that can be used in hybrid and cloud environments. This gives your business and data teams unprecedented speed and security while ensuring trust, security and governance of your data. -
17
Dawiso
Dawiso
$49 per user per monthDawiso is a comprehensive platform designed to simplify data management by integrating governance with usability for the entire organization. Central to Dawiso is its AI-powered data catalog, which empowers teams to quickly discover and understand trusted data across various systems, reports, and business applications. The platform’s flexible governance capabilities, alongside intuitive documentation apps, make it easy for both technical and non-technical users to collaborate effectively. Dawiso increases confidence in data through visual data lineage that clearly maps connections and dependencies across sources and systems. It supports regulatory compliance with customizable workflows, role-based access controls, and detailed metadata capture. By providing business-friendly tools and structured governance, Dawiso bridges communication gaps and streamlines data-driven decision-making. The platform promotes transparency, security, and usability in data management. Overall, Dawiso is built to enhance collaboration and trust in organizational data assets. -
18
OvalEdge, a cost-effective data catalogue, is designed to provide end-to-end data governance and privacy compliance. It also provides fast, reliable analytics. OvalEdge crawls the databases, BI platforms and data lakes of your organization to create an easy-to use, smart inventory. Analysts can quickly discover data and provide powerful insights using OvalEdge. OvalEdge's extensive functionality allows users to improve data access, data literacy and data quality.
-
19
Secuvy AI
Secuvy
Secuvy, a next-generation cloud platform, automates data security, privacy compliance, and governance via AI-driven workflows. Unstructured data is treated with the best data intelligence. Secuvy, a next-generation cloud platform that automates data security, privacy compliance, and governance via AI-driven workflows is called Secuvy. Unstructured data is treated with the best data intelligence. Automated data discovery, customizable subjects access requests, user validations and data maps & workflows to comply with privacy regulations such as the ccpa or gdpr. Data intelligence is used to locate sensitive and private information in multiple data stores, both in motion and at rest. Our mission is to assist organizations in protecting their brand, automating processes, and improving customer trust in a world that is rapidly changing. We want to reduce human effort, costs and errors in handling sensitive data. -
20
Zaloni Arena
Zaloni
An agile platform for end-to-end DataOps that not only enhances but also protects your data assets is available through Arena, the leading augmented data management solution. With our dynamic data catalog, users can enrich and access data independently, facilitating efficient management of intricate data landscapes. Tailored workflows enhance the precision and dependability of every dataset, while machine learning identifies and aligns master data assets to facilitate superior decision-making. Comprehensive lineage tracking, accompanied by intricate visualizations and advanced security measures like masking and tokenization, ensures utmost protection. Our platform simplifies data management by cataloging data from any location, with flexible connections that allow analytics to integrate seamlessly with your chosen tools. Additionally, our software effectively addresses the challenges of data sprawl, driving success in business and analytics while offering essential controls and adaptability in today’s diverse, multi-cloud data environments. As organizations increasingly rely on data, Arena stands out as a vital partner in navigating this complexity. -
21
ThinkData Works
ThinkData Works
ThinkData Works provides a robust catalog platform for discovering, managing, and sharing data from both internal and external sources. Enrichment solutions combine partner data with your existing datasets to produce uniquely valuable assets that can be shared across your entire organization. The ThinkData Works platform and enrichment solutions make data teams more efficient, improve project outcomes, replace multiple existing tech solutions, and provide you with a competitive advantage. -
22
Accurity
Accurity
Accurity serves as a comprehensive data intelligence platform that fosters a deep, organization-wide comprehension and unwavering confidence in your data, enabling you to accelerate essential decision-making processes, enhance revenue streams, cut down on expenses, and maintain compliance with data regulations. By harnessing timely, pertinent, and precise data, you can effectively meet and engage your customers, thereby amplifying your brand visibility and increasing sales conversions. With a unified interface, automated quality assessments, and structured workflows for data quality issues, you can significantly reduce both personnel and infrastructure expenses, allowing you to focus on leveraging your data rather than merely managing it. Uncover genuine value within your data by identifying and eliminating inefficiencies, refining your decision-making strategies, and uncovering impactful product and customer insights that can propel your company’s innovative initiatives forward. Ultimately, Accurity empowers businesses to transform their data into a strategic asset that drives growth and fosters a competitive edge. -
23
erwin Data Intelligence
Quest Software
$299 per monthErwin Data Intelligence (erwin DI) integrates data cataloging and data literacy functions to enhance awareness and accessibility of data assets while providing guidance for their usage and establishing safeguards to uphold data policies and best practices. It automatically gathers, transforms, and compiles metadata from a diverse range of data sources, business applications, operational processes, and data models into a centralized catalog. This catalog is then rendered accessible and comprehensible through role-specific, contextual views, empowering stakeholders to make informed strategic decisions based on reliable insights. Furthermore, erwin DI promotes enterprise data governance and supports digital transformation initiatives, alongside any endeavors that depend on data for successful results. It allows for the scheduling of regular metadata scans from a wide variety of data sources, simplifying the mapping of data elements from their origin to their destination, including data in transit, while facilitating seamless data integration across different platforms. In addition, it enables data consumers to identify and explore data that is pertinent to their specific roles, thereby enhancing overall data engagement within the organization. Ultimately, erwin DI serves as a powerful tool for maximizing the value derived from data assets. -
24
Datameer
Datameer
Datameer is your go-to data tool for exploring, preparing, visualizing, and cataloging Snowflake insights. From exploring raw datasets to driving business decisions – an all-in-one tool. -
25
Talend Data Catalog
Qlik
Talend Data Catalog provides your organization with a single point of control for all your data. Data Catalog provides robust tools for search, discovery, and connectors that allow you to extract metadata from almost any data source. It makes it easy to manage your data pipelines, protect your data, and accelerate your ETL process. Data Catalog automatically crawls, profiles and links all your metadata. Data Catalog automatically documents up to 80% of the data associated with it. Smart relationships and machine learning keep the data current and up-to-date, ensuring that the user has the most recent data. Data governance can be made a team sport by providing a single point of control that allows you to collaborate to improve data accessibility and accuracy. With intelligent data lineage tracking and compliance tracking, you can support data privacy and regulatory compliance. -
26
Y42
Datos-Intelligence GmbH
Y42 is the first fully managed Modern DataOps Cloud for production-ready data pipelines on top of Google BigQuery and Snowflake. -
27
Privacera
Privacera
Multi-cloud data security with a single pane of glass Industry's first SaaS access governance solution. Cloud is fragmented and data is scattered across different systems. Sensitive data is difficult to access and control due to limited visibility. Complex data onboarding hinders data scientist productivity. Data governance across services can be manual and fragmented. It can be time-consuming to securely move data to the cloud. Maximize visibility and assess the risk of sensitive data distributed across multiple cloud service providers. One system that enables you to manage multiple cloud services' data policies in a single place. Support RTBF, GDPR and other compliance requests across multiple cloud service providers. Securely move data to the cloud and enable Apache Ranger compliance policies. It is easier and quicker to transform sensitive data across multiple cloud databases and analytical platforms using one integrated system. -
28
DvSum, an AI-powered Data Intelligence platform, makes it remarkably easy for data and analytics teams discover, monitor, and govern data. DvSum uses powerful AI-enabled algorithms to automatically catalog, classify, and curate your data and make it available as a Data Catalog. DvSum Data Intelligence will help you propel your enterprise towards its digital- and analytics-enabled transformation goals.
-
29
DataOps.live
DataOps.live
Create a scalable architecture that treats data products as first-class citizens. Automate and repurpose data products. Enable compliance and robust data governance. Control the costs of your data products and pipelines for Snowflake. This global pharmaceutical giant's data product teams can benefit from next-generation analytics using self-service data and analytics infrastructure that includes Snowflake and other tools that use a data mesh approach. The DataOps.live platform allows them to organize and benefit from next generation analytics. DataOps is a unique way for development teams to work together around data in order to achieve rapid results and improve customer service. Data warehousing has never been paired with agility. DataOps is able to change all of this. Governance of data assets is crucial, but it can be a barrier to agility. Dataops enables agility and increases governance. DataOps does not refer to technology; it is a way of thinking. -
30
Global IDs
Global IDs
Explore the exceptional features offered by Global IDs, which provide a comprehensive range of Enterprise Data Solutions including data governance, compliance, cloud migration, rationalization, privacy, analytics, and more. The Global IDs EDA Platform includes essential functionalities such as automated discovery and profiling, data classification, data lineage, and data quality, all aimed at ensuring that data is transparent, reliable, and understandable throughout the ecosystem. Additionally, the architecture of the Global IDs EDA platform is built for seamless integration, enabling access to all its functionalities through APIs. This platform effectively automates data management for organizations of varying sizes and diverse data environments. By utilizing Global IDs EDA, businesses can significantly enhance their data management practices and drive better decision-making. -
31
MetaCenter
Data Advantage Group
MetaCenter allows both business and technology departments to systematically organize and categorize the information resources of an organization. Through this platform, users can independently inquire about their data assets and understand the movement of data within the business, as well as determine the appropriate usage classifications. This capability helps organizations minimize expenses while enhancing flexibility and mitigating operational risks. The search-based semantic layer facilitates the automatic cross-referencing of models. Additionally, tailored Faceted Views for specific data assets can be made accessible to distinct roles within the organization. The lower total cost of ownership combined with increased automation results in a superior return on investment when compared to rival solutions. Moreover, the intuitive, GUI-driven customization allows for quick application adaptations without the need for programming expertise or professional services, making it highly user-friendly. Ultimately, this empowers teams to respond swiftly to changing business needs and enhances overall productivity. -
32
Dataedo
Dataedo
$49 per monthUncover, record, and oversee your metadata effectively. Dataedo features a range of automated metadata scanners designed to interface with different database technologies, where they extract data structures and metadata to populate your metadata repository. With just a few clicks, you can create a comprehensive catalog of your data while detailing each component. Clarify table and column names with user-friendly aliases, and enrich your understanding of data assets by adding descriptions and custom fields defined by users. Leverage sample data to gain insights into the contents of your data assets, allowing you to grasp the information better prior to utilization and ensuring its quality. Maintain high data standards through data profiling techniques. Facilitate widespread access to data knowledge across your organization. Enhance data literacy, democratize data access, and empower all members of your organization to leverage data more effectively with a simple on-premises data catalog solution. Strengthening data literacy through a well-structured data catalog will ultimately lead to improved decision-making processes. -
33
Tree Schema Data Catalog
Tree Schema
$99 per monthThis is the essential tool for metadata management. In just 5 minutes, automatically populate your entire catalogue! Data Discovery. Data Discovery. Find the data you need from any part of your data ecosystem, starting with the database and ending with the specific values for each field. Automated documentation of your data from existing data storage. First-class support for unstructured and tabular data. Automated data governance actions. Data Lineage. Data Lineage. Explore your data lineage to understand where your data is coming from and where it is headed. View the impact analysis of changes. See all up- and downstream impacts. Visualize connections and relationships. API AccessNew. Tree Schema API allows you to manage your data lineage in code and keep your catalog current. Integrate Data Lineage in CICD pipelines Capture values & description within your code Analyze the impact of breaking changes. Data Dictionary. Know the key terms and lingo which drive your business. Define the context and scope of keywords -
34
Kylo
Teradata
Kylo serves as an open-source platform designed for effective management of enterprise-level data lakes, facilitating self-service data ingestion and preparation while also incorporating robust metadata management, governance, security, and best practices derived from Think Big's extensive experience with over 150 big data implementation projects. It allows users to perform self-service data ingestion complemented by features for data cleansing, validation, and automatic profiling. Users can manipulate data effortlessly using visual SQL and an interactive transformation interface that is easy to navigate. The platform enables users to search and explore both data and metadata, examine data lineage, and access profiling statistics. Additionally, it provides tools to monitor the health of data feeds and services within the data lake, allowing users to track service level agreements (SLAs) and address performance issues effectively. Users can also create batch or streaming pipeline templates using Apache NiFi and register them with Kylo, thereby empowering self-service capabilities. Despite organizations investing substantial engineering resources to transfer data into Hadoop, they often face challenges in maintaining governance and ensuring data quality, but Kylo significantly eases the data ingestion process by allowing data owners to take control through its intuitive guided user interface. This innovative approach not only enhances operational efficiency but also fosters a culture of data ownership within organizations. -
35
Rocket Data Intelligence
Rocket Software
A metadata management and data lineage platform for hybrid enterprises whose data spans mainframe, distributed, and cloud. It automatically discovers datasets, pipelines, dependencies, and transformations, then provides end-to-end lineage and impact analysis so teams can trace a KPI to its source, predict what will break before changing a job/table, and prove where sensitive fields (PII) flowed. Key capabilities: • Automated metadata collection across heterogeneous platforms. • Lineage mapping from source through ETL/ELT, warehouse/lakehouse, and BI. • Impact analysis and change visibility. • Field/column-level tracing (where supported) for audits, root-cause analysis, and compliance. • Glossary/tagging to connect technical assets to business definitions and ownership. Outcome: fewer production surprises, faster modernization, and more trusted analytics/AI backed by audit-ready evidence. Partner with us to unlock actionable insights and modernize your data strategy today. -
36
Catalog
Coalesce
$699 per monthCastor serves as a comprehensive data catalog aimed at facilitating widespread use throughout an entire organization. It provides a holistic view of your data ecosystem, allowing you to swiftly search for information using its robust search capabilities. Transitioning to a new data framework and accessing necessary data becomes effortless. This approach transcends conventional data catalogs by integrating various data sources, thereby ensuring a unified truth. With an engaging and automated documentation process, Castor simplifies the task of establishing trust in your data. Within minutes, users can visualize column-level, cross-system data lineage. Gain an overarching perspective of your data pipelines to enhance confidence in your data integrity. This tool enables users to address data challenges, conduct impact assessments, and ensure GDPR compliance all in one platform. Additionally, it helps in optimizing performance, costs, compliance, and security associated with your data management. By utilizing our automated infrastructure monitoring system, you can ensure the ongoing health of your data stack while streamlining data governance practices. -
37
Validio
Validio
Examine the usage of your data assets, focusing on aspects like popularity, utilization, and schema coverage. Gain vital insights into your data assets, including their quality and usage metrics. You can easily locate and filter the necessary data by leveraging metadata tags and descriptions. Additionally, these insights will help you drive data governance and establish clear ownership within your organization. By implementing a streamlined lineage from data lakes to warehouses, you can enhance collaboration and accountability. An automatically generated field-level lineage map provides a comprehensive view of your entire data ecosystem. Moreover, anomaly detection systems adapt by learning from your data trends and seasonal variations, ensuring automatic backfilling with historical data. Thresholds driven by machine learning are specifically tailored for each data segment, relying on actual data rather than just metadata to ensure accuracy and relevance. This holistic approach empowers organizations to better manage their data landscape effectively. -
38
Alex Solutions
Alex Solutions
The Alex Platform serves as the definitive source of data and business accuracy for your organization. It is a crucial element that underpins our clients' success in utilizing data effectively. From the very first day of its deployment, Alex is engineered to simplify operations and deliver value right from the start. The Alex Augmented Data Catalog harnesses top-tier machine learning technology, swiftly creating a cohesive data environment across the entire enterprise. Regardless of the intricacies of your technical framework, Alex Data Lineage enables you to effortlessly track and comprehend your data movements in a secure and automated manner. In an increasingly interconnected world, global teams require seamless coordination. The Alex Intelligent Business Glossary boasts an attractive user interface and comprehensive features, making it ideal for fostering international collaboration. By consolidating all definitions, policies, metrics, rules, and workflows, you can effectively tackle the challenges posed by multi-cloud environments and global enterprises. This approach empowers robust data governance initiatives, ensuring that all aspects of data management are consistently aligned across the organization. Ultimately, Alex not only streamlines operations but also enhances the overall strategic decision-making process. -
39
Blindata
Blindata
$1000/year/ user Blindata encompasses all the essential components of a comprehensive Data Governance program. Its features, including the Business Glossary, Data Catalog, and Data Lineage, work together to provide a cohesive and thorough perspective on your data. The Data Classification module assigns semantic significance to the data, while the Data Quality, Issue Management, and Data Stewardship modules enhance data reliability and foster trust. Additionally, specific functionalities for privacy compliance are available, such as a registry for processing activities, centralized management of privacy notes, and a consent registry that incorporates Blockchain for notarization. The Blindata Agent facilitates connections to various data sources, enabling the collection of metadata, including data structures like Tables, Views, and Fields, as well as data quality metrics and reverse lineage. With a modular design and fully API-driven architecture, Blindata supports seamless integration with vital business systems, including DBMS, Active Directory, e-commerce platforms, and various Data Platforms. This versatile solution can be deployed as a Software as a Service (SaaS), installed on-premises, or acquired through the AWS Marketplace, making it accessible for a wide range of organizational needs. Its flexibility ensures that businesses can tailor their Data Governance approach to meet specific requirements effectively. -
40
Acryl Data
Acryl Data
Bid farewell to abandoned data catalogs. Acryl Cloud accelerates time-to-value by implementing Shift Left methodologies for data producers and providing an easy-to-navigate interface for data consumers. It enables the continuous monitoring of data quality incidents in real-time, automating anomaly detection to avert disruptions and facilitating swift resolutions when issues arise. With support for both push-based and pull-based metadata ingestion, Acryl Cloud simplifies maintenance, ensuring that information remains reliable, current, and authoritative. Data should be actionable and operational. Move past mere visibility and leverage automated Metadata Tests to consistently reveal data insights and identify new opportunities for enhancement. Additionally, enhance clarity and speed up resolutions with defined asset ownership, automatic detection, streamlined notifications, and temporal lineage for tracing the origins of issues while fostering a culture of proactive data management. -
41
Unity Catalog
Databricks
The Unity Catalog from Databricks stands out as the sole comprehensive and open governance framework tailored for data and artificial intelligence, integrated within the Databricks Data Intelligence Platform. This innovative solution enables organizations to effortlessly manage structured and unstructured data in various formats, in addition to machine learning models, notebooks, dashboards, and files on any cloud or platform. Data scientists, analysts, and engineers can securely navigate, access, and collaborate on reliable data and AI resources across diverse environments, harnessing AI capabilities to enhance efficiency and realize the full potential of the lakehouse architecture. By adopting this cohesive and open governance strategy, organizations can foster interoperability and expedite their data and AI projects, all while making regulatory compliance easier to achieve. Furthermore, users can quickly identify and categorize both structured and unstructured data, including machine learning models, notebooks, dashboards, and files, across all cloud platforms, ensuring a streamlined governance experience. This comprehensive approach not only simplifies data management but also encourages a collaborative culture among teams. -
42
Effortlessly monitor thousands of tables through machine learning-driven anomaly detection alongside a suite of over 50 tailored metrics. Ensure comprehensive oversight of both data and metadata while meticulously mapping all asset dependencies from ingestion to business intelligence. This solution enhances productivity and fosters collaboration between data engineers and consumers. Sifflet integrates smoothly with your existing data sources and tools, functioning on platforms like AWS, Google Cloud Platform, and Microsoft Azure. Maintain vigilance over your data's health and promptly notify your team when quality standards are not satisfied. With just a few clicks, you can establish essential coverage for all your tables. Additionally, you can customize the frequency of checks, their importance, and specific notifications simultaneously. Utilize machine learning-driven protocols to identify any data anomalies with no initial setup required. Every rule is supported by a unique model that adapts based on historical data and user input. You can also enhance automated processes by utilizing a library of over 50 templates applicable to any asset, thereby streamlining your monitoring efforts even further. This approach not only simplifies data management but also empowers teams to respond proactively to potential issues.
-
43
Collate
Collate
FreeCollate is a metadata platform powered by AI that equips data teams with automated tools for discovery, observability, quality, and governance, utilizing agent-based workflows for efficiency. It is constructed on the foundation of OpenMetadata and features a cohesive metadata graph, providing over 90 seamless connectors for gathering metadata from various sources like databases, data warehouses, BI tools, and data pipelines. This platform not only offers detailed column-level lineage and data profiling but also implements no-code quality tests to ensure data integrity. The AI agents play a crucial role in streamlining processes such as data discovery, permission-sensitive querying, alert notifications, and incident management workflows on a large scale. Furthermore, the platform includes real-time dashboards, interactive analyses, and a shared business glossary that cater to both technical and non-technical users, facilitating the management of high-quality data assets. Additionally, its continuous monitoring and governance automation help uphold compliance with regulations such as GDPR and CCPA, which significantly minimizes the time taken to resolve data-related issues and reduces the overall cost of ownership. This comprehensive approach to data management not only enhances operational efficiency but also fosters a culture of data stewardship across the organization. -
44
Select Star
Select Star
$270 per monthIn just 15 minutes, you can set up your automated data catalogue and receive column-level lines, Entity Relationship diagrams, and auto-populated documentation in 24 hours. You can easily tag, find, and add documentation to data so everyone can find the right one for them. Select Star automatically detects your column-level data lineage and displays it. Now you can trust the data by knowing where it came. Select Star automatically displays how your company uses data. This allows you to identify relevant data fields without having to ask anyone else. Select Star ensures that your data is protected with AICPA SOC2 Security, Confidentiality and Availability standards. -
45
Foundational
Foundational
Detect and address code and optimization challenges in real-time, mitigate data incidents before deployment, and oversee data-affecting code modifications comprehensively—from the operational database to the user interface dashboard. With automated, column-level data lineage tracing the journey from the operational database to the reporting layer, every dependency is meticulously examined. Foundational automates the enforcement of data contracts by scrutinizing each repository in both upstream and downstream directions, directly from the source code. Leverage Foundational to proactively uncover code and data-related issues, prevent potential problems, and establish necessary controls and guardrails. Moreover, implementing Foundational can be achieved in mere minutes without necessitating any alterations to the existing codebase, making it an efficient solution for organizations. This streamlined setup promotes quicker response times to data governance challenges.