High data quality helps data analysts get deeper insights from BI and analytics tools. Zuse drastically reduces the time and effort required to prepare your data.
Run automated data pipelines from various sources to Amazon Redshift, Microsoft Azure SQL and other popular cloud data warehouses.
The effectiveness of a machine learning model depends largely on the quality of the data used to train it. Zuse helps data scientists with data cleansing at scale, without any coding.
Migrate data between business apps using Zuse. Cleanse, enrich & stream your data across data stores at scale and rapidly, with de-duplication & real-time data integrity checks.
Omni-Connect
Connect to multiple sources
Eliminate data silos and connect to a variety of data sources including files, feeds, cloud storage, databases, warehouses and business applications. Zuse is suited for 50+ data sources for on-prem and cloud with built-in connectors to import data.
Pre-built for Data Quality
Improve data quality
Benefit from intelligent suggestions to unlock true potential of data - with no-code data cleansing engine & pre-built transformations. Automatically identify data types, get suggestions for datasets to identify and fix invalid data. Understand data better with the help of widgets such as value distribution, value statistics, text patterns, outliers and more.
Transform & Enrich Data
Transform & enrich
Transform data without coding. Highlight required data for accurate suggestions to extract, count, replace and split data. Format and change your data using more than 250 transforms. Reshape data using unpivot, pivot, and summary transforms. Blend data from a variety of sources using join and append transforms.
Automated Data Workflow
Automate workflows
Schedule & run source-to-destination data pipelines to transport data across multiple sources & sync to various destinations fast. Monitor data quality and get alerted for drops. Use rulesets to track the changes made to your data and reuse rulesets across datasets.
Intuitive Data Organization
Cataloging and governance
Built-in capabilities to help classify, catalog, and govern data. Discover data easily using system-wide search capabilities. Add metadata in datasets and workspaces using tags and improve searchability & filtering. Find relevant details about datasets and workspaces in a single pane.
Security, Privacy & Compliance
Security, privacy & compliance
Collaborate securely with role-based access, Track datasets when shared and exported & ensure privacy by masking sensitive data. Mark and secure sensitive data in datasets. Control access and prevent unauthorized exports. Share workspaces with users and groups and set role-based access controls. Track datasets when shared & exported and verify if security measures were applied to protect sensitive data.
Eliminate data silos and connect to a variety of data sources including files, feeds, cloud storage, databases, warehouses and business applications. Zuse is suited for 50+ data sources for on-prem and cloud with built-in connectors to import data.
Transformations Library
Data Quality Monitor
Streaming Pipelines
Collate scattered data
Anywhere-to-anywhere data transfer
Extensive data format coverage
Multiple Source and destination types - files, cloud storage, databases
Data on Data
No-Code & On-Tap
Suited for 50+ data sources
Benefit from intelligent suggestions to unlock true potential of data - with no-code data cleansing engine & pre-built transformations. Automatically identify data types, get suggestions for datasets to identify and fix invalid data. Understand data better with the help of widgets such as value distribution, value statistics, text patterns, outliers and more.
Transformations Library
Data Quality Monitor
Streaming Pipelines
Pre-built data transformations
No-code data cleansing
Intelligent data-cleansing suggestions
predict data fixes based on format and structures
Quick inputs on data outliers and anamolies
Data on Data
No-Code & On-Tap
Suited for 50+ data sources
Transform data without coding. Highlight required data for accurate suggestions to extract, count, replace and split data. Format and change your data using more than 250 transforms. Reshape data using unpivot, pivot, and summary transforms. Blend data from a variety of sources using join and append transforms.
Transformations Library
Data Quality Monitor
Streaming Pipelines
Streamline data transformation
Interactive data prep wizard
Easily format and reshape data with preloaded transforms
Custom transformation builder
OpenAI powered - Natural language transformation instructions
Data on Data
No-Code & On-Tap
Suited for 50+ data sources
Schedule & run source-to-destination data pipelines to transport data across multiple sources & sync to various destinations fast. Monitor data quality and get alerted for drops. Use rulesets to track the changes made to your data and reuse rulesets across datasets.
Transformations Library
Data Quality Monitor
Streaming Pipelines
Source-to-destination data pipeline
Self-service scheduler
Data quality monitors and instant alerts
Define rulesets once and run on autopilot
Trace and audit trail of changes to data
Data on Data
No-Code & On-Tap
Suited for 50+ data sources
Built-in capabilities to help classify, catalog, and govern data. Discover data easily using system-wide search capabilities. Add metadata in datasets and workspaces using tags and improve searchability & filtering. Find relevant details about datasets and workspaces in a single pane.
Transformations Library
Data Quality Monitor
Streaming Pipelines
Classify and Catalog data
Metadata for improved discovery of data
Govern data export and sharing criteria
Tags for improved searchability and filters
Data on Data
No-Code & On-Tap
Suited for 50+ data sources
Collaborate securely with role-based access, Track datasets when shared and exported & ensure privacy by masking sensitive data. Mark and secure sensitive data in datasets. Control access and prevent unauthorized exports. Share workspaces with users and groups and set role-based access controls. Track datasets when shared & exported and verify if security measures were applied to protect sensitive data.
Transformations Library
Data Quality Monitor
Streaming Pipelines
Role based access
Sensitive data masking and tokenizing
Control access and actions on sensitive data
Audit trail of actions and changes to sensitive data
Encryption in-transit and at-rest
Privacy and Data protection compliant
Data on Data
No-Code & On-Tap
Suited for 50+ data sources
1. What is Zuse?
Zuse is an advanced self-service data preparation tool that helps organizations model, cleanse, prepare, enrich and organize large volumes of data from multiple data sources to serve data analytics and data warehousing with exceptional data quality, all without the need for any coding.
2. Can I get a quick walk through session of Zuse?
Yes, please request a personalized demo by mailing us at connect@kairhos.com.
3. What information is collected and how is it used?
We do not access any data that you upload to Zuse. All data is encrypted at our data centers. We only collect the basic information on how you use the product and the features most used so that we can enhance and make them better. We assure you that we do not share this information externally and use this data only for our internal evaluation.
4. How many imports and exports am I allowed to schedule at a time in Zuse?
You can schedule upto 100 data imports and 100 data exports at a time in Zuse by configuring the schedule details, setting the required frequency and import or export the data to any of the supported data sources.
5. What is the maximum number of columns that I can create within a dataset?
In Zuse, we have enabled it to incorporate upto 333 columns within a single dataset. This allows you to prepare large amounts of data with ease.
6. Can I import files of any size in Zuse?
You can import JSON files of a maximum size of 20 MB and import other supported files upto a size of 100 MB. The supported file types are CSV, TSV, JSON, HTML, XLS, XLSX and XML.
7. How many files and tables can I import in Zuse at a time?
We support the import of upto 10 files or tables at a time in Zuse. When multiple files are imported, you will be taken to the workspace details page where all the datasets that are imported will be displayed.
1. Zuse Overview
2. Getting Started
2.1 Setting up your organization
2.2 Home Page
2.3 Entities in Zuse
2.4 Workspace details page
3. Data Import
3.1 Add new dataset
3.2 Import data from local files
3.3 Import data from local databases
3.4 Import data from cloud databases
3.5 Import data from FTP servers
3.6 Import data from URLs
3.7 Import data from Google BigQuery
3.8 Saved data connections
3.9 Schedule import
3.10 Reload data
4. Data Transformation
4.1 Zuse Studio
4.2 Column transformations
4.3 Dataset transforms
4.4 Datatypes
5. Data Export
5.1 Export options
5.2 Local files
5.3 Export data to FTP
5.4 Cloud Storage Services
5.5 Cloud databases
5.6 Export data to Google BigQuery
5.8 Schedule export
5.9 Processing history and processed data
6. Sharing & Collaboration
6.1 Workspaces
6.2 Datasets
6.3 Ruleset templates
6.4 Roles and permissions
6.5 Data cataloguing
7. Privacy & Compliance
7.1 Encryption at Zuse
7.2 GDPR Compliance in Zuse
7.3 HIPAA Compliance in Zuse
8. FAQs
GDPR Compliance in Zuse
Features that Zuse offers with respect to GDPR compliance:
· Marking Personal Data
· Encryption at Rest
· Enhanced Data Privacy and Security
· Right to Data Portability
· Right to Erase and Forget Data
· Zuse's GDPR Compliance and Privacy Policy
Field Level Encryption
The sensitive data you input into the application, or the sensitive service data, is stored in respective Zuse service database. Data in these are encrypted according to AES 256 standard with AES/CBC/PKCS5Padding mode. The data that is encrypted at rest varies with the services you opt for about the data we encrypt in our services.
There are two types of field level encryption-
· Depending on the sensitivity of data
· Depending on the search functionality
File encryption
The files you create or attach are saved into our Distributed File System (DFS). The files that are encrypted at rest varies with the services you opt for about the data we encrypt in our services. The encryption happens at application layer and only the authorized application user will be able to view the data.
URL encryption
Anything that is identifiable in the URL, say the ID of a document, that part is encrypted.
This encryption has two types- One key per Org or One key per feature. Again, this is decided by the sensitivity of data in the URL.
Backup encryption
We backup data frequently, and our backup servers are equipped with the same standard of protection as the main servers. All data we take as backup will be encrypted at rest.
Encryption of Logs
Zuse Logs uses Hadoop Distributed File System (HDFS) to store and manage logs. We use the encryption technology of Hadoop to encrypt the data while key management is handled by our KMS.
Encryption of cache
We use Redis open-source software for storing and managing cache data. If any data contains sensitive personal information, we choose to encrypt it.
Key Management
Our in-house Key Management Service (KMS) creates, stores and manages keys across all services. We own and maintain the keys using KMS. Currently, we do not have the provision to encrypt data with keys owned by the customer.
Full-disk encryption
We employ self-encrypting drives (SEDs) to support hardware-based full disk encryption. An SED is a hard disk drive (HDD) or a solid-state drive (SSD) that has an encryption circuit built into it.