Connecting to External Data Sources
Last updated July 29, 2024
In Mostly AI, you can seamlessly connect to various external data sources, bringing your valuable data into the platform for analysis, model training, and prediction. This connectivity empowers you to leverage a wide range of data assets, both structured and unstructured, to enhance your machine learning projects.
Supported Data Sources
- Databases:
- Relational Databases (SQL): Connect to popular SQL databases like MySQL, PostgreSQL, and SQL Server to access structured data.
- NoSQL Databases: Integrate with NoSQL databases such as MongoDB and Cassandra for flexible data structures.
- Cloud Storage:
- Amazon S3: Access data stored in Amazon S3 buckets.
- Google Cloud Storage: Connect to Google Cloud Storage for data retrieval.
- Azure Blob Storage: Integrate with Azure Blob Storage for data access and storage.
- Files:
- CSV: Import data from comma-separated value files.
- Excel: Load data from Excel spreadsheets.
- JSON: Access data in JSON format.
- Other formats: Support for various other common data file formats.
- APIs:
- REST APIs: Connect to web services and APIs to retrieve real-time data or updates.
- Custom APIs: Create custom API connections to integrate with specific systems or applications.
Connecting to External Data
- Navigate to the "Data" Section: Go to the "Data" section within your project.
- Choose "Connect to Data Source": Select the option to connect to an external data source.
- Select Data Source Type: Specify the type of data source (database, cloud storage, file, API) you want to connect to.
- Enter Connection Information: Provide the necessary credentials and connection details for your chosen data source.
- Test the Connection: Verify that the connection is established correctly before proceeding.
- Select Data: Choose the table, file, or endpoint that holds the data you want to import.
- Import Data: Initiate the data import process.
Benefits of Connecting to External Data
- Data Integration: Bring diverse data from different sources into a centralized platform for analysis and modeling.
- Real-Time Data Access: Retrieve real-time data updates from APIs or streaming sources to enhance model accuracy and responsiveness.
- Data Enrichment: Combine data from multiple sources to create comprehensive datasets for more insightful analysis.
- Automated Processes: Integrate external data sources into your automated workflows for continuous data ingestion and processing.
By connecting to external data sources, you can expand your machine learning capabilities within Mostly AI, enabling you to leverage a wider range of data assets for more powerful insights and informed decision-making.
Was this article helpful?