Skip to content

Data Collectors

Most customers will have an on premise data collector to connect to data sources that are on premise.

Most customers will also have a cloud data collector to connect to SaaS/cloud data sources such as Qualys, CrowdStrike or Carbon Black.

This guide covers how to create and manage Data Source Credentials and Data Sources in Glue.


Data Source Credentials store authentication information that can be used by Data Sources to connect to external systems securely.

  1. Navigate to Data Collectors from the main menu
  2. Click on Credentials in the breadcrumb navigation

Note: This feature requires GlueAdmin role access.

  1. Click the Create button in the top-right corner of the page

  2. A modal window will appear with the following fields:

    FieldDescriptionRequired
    NameUnique identifier for the credentialYes
    DescriptionDetails about what this credential is used forNo
    UsernameThe username for authenticationYes
    PasswordThe password for authentication (securely stored)Yes
    EnabledWhether this credential is active and available for useYes
  3. Fill in all required fields

  4. Click Save to create the credential

  5. You’ll see a success notification confirming the credential was created

You can edit credentials directly from the grid or through the edit dialog:

  1. Locate the credential in the grid
  2. Click the Edit (pencil icon) button in the command column
  3. Make your changes in the popup editor
  4. Click Save to confirm or Cancel to discard changes
  1. Click on the credential’s Name link in the grid
  2. This will take you to the detailed view where you can make changes
  3. Save your changes

Important: You cannot change passwords through inline editing for security reasons.

  1. Locate the credential you want to delete in the grid
  2. Click the Delete (trash icon) button in the command column
  3. Confirm the deletion when prompted
  4. The credential will be permanently removed

Warning: Deleting a credential that is in use by active Data Sources may cause those sources to fail. Verify no sources are using this credential before deletion.

  1. Click the Export button with the Excel icon in the grid toolbar
  2. The credentials list will be exported to an Excel file
  3. Note: Passwords are not included in exports for security reasons

The Data Source Credentials grid supports:

  • Sorting: Click column headers to sort
  • Filtering: Use the column menu to filter data
  • Column Reordering: Drag column headers to rearrange
  • Column Resizing: Drag column borders to resize
  • Pagination: Navigate through pages at the bottom of the grid

Data Sources define the connections between Data Collectors and external systems that provide data.

  1. Navigate to Data Collectors from the main menu
  2. Select a specific Data Collector
  3. Click on the Data Sources tab
  1. Click the Create New Data Source button in the top-right corner

  2. A modal window will appear with configuration options:

    FieldDescriptionRequired
    NameDescriptive name for the data sourceYes
    TypeThe type of data source (e.g., Active Directory, Intune, etc. Use Generic if there is no specific type)Yes
    DescriptionDetails about what data this source providesNo
    EnabledWhether this data source is activeYes
    Additional FieldsType-specific configuration fieldsVaries
  3. Complete all required fields

  4. Configure any additional fields specific to your data source type

  5. Click Save to create the data source

  6. A success notification will confirm creation

  1. Locate the data source in the grid
  2. Click the Update (pencil icon) button in the command column
  3. The edit modal will open with current values
  4. Make your desired changes
  5. Click Save to update or Close to cancel

A unique, descriptive identifier for your data source. Choose names that clearly indicate the source’s purpose.

The category of data source. This determines what configuration options are available. Choose the specific type for the source you are setting up or otherwise use Generic.

Optional but recommended. Explain what data this source provides and any important details about its configuration.

Read-only field showing when the data source last successfully synchronized data.

Controls whether the Data Collector will attempt to use this source. Disable sources for:

  • Temporary maintenance
  • Testing purposes
  • Deprecated sources you’re not ready to delete

Read-only field indicating the current state:

  • Active: Successfully connected and functioning
  • Error: Connection or synchronization issues
  • Unknown: Not yet tested or status unavailable

Data Sources are processed in order. To change the execution order:

  1. Click and hold on any row in the grid
  2. Drag the row to its new position
  3. Drop it either:
    • Before another row (drop above)
    • After another row (drop below)
  4. The system will automatically save the new order
  5. All affected sources will be reordered accordingly

Tip: Order matters when sources have dependencies. Ensure sources that provide data to other sources are ordered first.

The Data Sources grid supports:

  • Filtering: Use the filter menu on columns to find specific sources
  • Sorting: Click column headers to sort by any field
  • Drag & Drop Reordering: Rearrange execution order visually
  • Column Resizing: Adjust column widths as needed
  • Column Reordering: Rearrange columns by dragging headers

  1. Use Descriptive Names: Include the system name and environment (e.g., “ProdSQL-HRDatabase”)
  2. Keep Descriptions Current: Document what systems use each credential
  3. Regular Audits: Review and remove unused credentials quarterly
  4. Principle of Least Privilege: Use accounts with minimum necessary permissions
  5. Disable Before Delete: Disable credentials first to identify dependent sources
  1. Clear Naming Conventions: Use consistent naming patterns across your organization
  2. Complete Descriptions: Document data content, refresh schedules, and dependencies
  3. Test Before Enabling: Create new sources as disabled, test thoroughly, then enable
  4. Monitor Status: Regularly check the Status column for errors
  5. Logical Ordering: Group related sources together in the execution order
  6. Document Dependencies: Note in descriptions if a source depends on data from another