[DRAFT] AIO PR #2641

sfc-gh-turbaszek · 2025-11-12T17:06:09Z

Please answer these questions before submitting your pull requests. Thanks!

What GitHub issue is this PR addressing? Make sure that there is an accompanying issue to your PR.

Fixes #NNNN
Fill out the following pre-review checklist:
- I am adding a new automated test(s) to verify correctness of my new code
- I am adding new logging messages
- I am adding a new telemetry message
- I am modifying authorization mechanisms
- I am adding new credentials
- I am modifying OCSP code
- I am adding a new dependency
Please describe how your code solves the related issue.

Please write a short description of how your code change solves the related issue.
(Optional) PR for stored-proc connector:

…2432)

…2254)

…dling of PAT in password field (#2264)

…r to decouple threads number limitations on fetching and pre-fetching (#2255)

(cherry picked from commit 01d4159)

Co-authored-by: Patryk Czajka <patryk.czajka@snowflake.com> (cherry picked from commit 54906d6)

Co-authored-by: Patryk Czajka <patryk.czajka@snowflake.com>

…2631) The failure was caused by boto PythonDeprecationWarning. To avoid if/else logic for checking boto availability I decided to check suffixes of the warnings instead of their types.

…re for impersonation (#2510)

Co-authored-by: Peter Mansour <peter.mansour@snowflake.com>

…2521)

…ackend (#2521)

… write_pandas

…n pandas.to_sql anyway

.github/workflows/build_test.yml

-    name: Test FIPS linux-3.8-${{ matrix.cloud-provider }}
+    name: Test FIPS linux-3.9-${{ matrix.cloud-provider }}
    needs: build
    runs-on: ubuntu-latest
    strategy:
      fail-fast: false
      matrix:
        cloud-provider: [aws]
    steps:
      - uses: actions/checkout@v4
      - name: Setup parameters file
        shell: bash
        env:
          PARAMETERS_SECRET: ${{ secrets.PARAMETERS_SECRET }}
        run: |
          gpg --quiet --batch --yes --decrypt --passphrase="$PARAMETERS_SECRET" \
          .github/workflows/parameters/public/parameters_${{ matrix.cloud-provider }}.py.gpg > test/parameters.py
+      - name: Setup private key file
+        shell: bash
+        env:
+          PYTHON_PRIVATE_KEY_SECRET: ${{ secrets.PYTHON_PRIVATE_KEY_SECRET }}
+        run: |
+          gpg --quiet --batch --yes --decrypt --passphrase="$PYTHON_PRIVATE_KEY_SECRET" \
+          .github/workflows/parameters/public/rsa_keys/rsa_key_python_${{ matrix.cloud-provider }}.p8.gpg > test/rsa_key_python_${{ matrix.cloud-provider }}.p8
      - name: Download wheel(s)
        uses: actions/download-artifact@v4
        with:
-          name: manylinux_x86_64_py3.8
+          name: manylinux_x86_64_py3.9
          path: dist
      - name: Show wheels downloaded
        run: ls -lh dist
        shell: bash
      - name: Run tests
        run: ./ci/test_fips_docker.sh
        env:
-          PYTHON_VERSION: 3.8
+          PYTHON_VERSION: 3.9
          cloud_provider: ${{ matrix.cloud-provider }}
          PYTEST_ADDOPTS: --color=yes --tb=short
          TOX_PARALLEL_NO_SPINNER: 1
        shell: bash
      - uses: actions/upload-artifact@v4
        with:
          include-hidden-files: true
-          name: coverage_linux-fips-3.8-${{ matrix.cloud-provider }}
+          name: coverage_linux-fips-3.9-${{ matrix.cloud-provider }}
          path: |
            .coverage
            coverage.xml
+      - uses: actions/upload-artifact@v4
+        with:
+          include-hidden-files: true
+          name: junit_linux-fips-3.9-${{ matrix.cloud-provider }}
+          path: |
+            junit.*.xml

  test-lambda:
    name: Test Lambda linux-${{ matrix.python-version }}-${{ matrix.cloud-provider }}
    needs: build
    runs-on: ubuntu-latest
    strategy:
      fail-fast: false
      matrix:
-        python-version: ["3.8", "3.9", "3.10", "3.11", "3.12"]
+        # TODO: temporarily reduce number of jobs: SNOW-2311643
+        # python-version: ["3.9", "3.10", "3.11", "3.12", "3.13"]


The best way to fix the problem is to add a permissions block at the root of the workflow file (.github/workflows/build_test.yml), just after the name and before or after the on: trigger, but before defining jobs. This block should minimally specify contents: read, which means the default GITHUB_TOKEN provided to all jobs will only be able to read repository contents, not write or modify anything. If any job requires elevated permissions, those can be configured individually at the job level, but in this workflow, based on provided code snippets, none of the jobs appear to require more than contents: read.

Specifically, add:

permissions: contents: read

directly after the workflow name at the top of .github/workflows/build_test.yml.

No imports or definitions are required; just the YAML block as described.

.github/workflows/build_test.yml

+        python-version: ["3.13"]
+        cloud-provider: [aws, azure, gcp]
+    steps:
+      - uses: actions/checkout@v4
+      - name: Set up Python
+        uses: actions/setup-python@v4
+        with:
+          python-version: ${{ matrix.python-version }}
+      - name: Display Python version
+        run: python -c "import sys; print(sys.version)"
+      - name: Set up Java
+        uses: actions/setup-java@v4 # for wiremock
+        with:
+          java-version: 11
+          distribution: 'temurin'
+          java-package: 'jre'
+      - name: Fetch Wiremock
+        shell: bash
+        run: curl https://repo1.maven.org/maven2/org/wiremock/wiremock-standalone/3.11.0/wiremock-standalone-3.11.0.jar --output .wiremock/wiremock-standalone.jar
+      - name: Setup parameters file
+        shell: bash
+        env:
+          PARAMETERS_SECRET: ${{ secrets.PARAMETERS_SECRET }}
+        run: |
+          gpg --quiet --batch --yes --decrypt --passphrase="$PARAMETERS_SECRET" \
+          .github/workflows/parameters/public/parameters_${{ matrix.cloud-provider }}.py.gpg > test/parameters.py
+      - name: Setup private key file
+        shell: bash
+        env:
+          PYTHON_PRIVATE_KEY_SECRET: ${{ secrets.PYTHON_PRIVATE_KEY_SECRET }}
+        run: |
+          gpg --quiet --batch --yes --decrypt --passphrase="$PYTHON_PRIVATE_KEY_SECRET" \
+          .github/workflows/parameters/public/rsa_keys/rsa_key_python_${{ matrix.cloud-provider }}.p8.gpg > test/rsa_key_python_${{ matrix.cloud-provider }}.p8
+      - name: Download wheel(s)
+        uses: actions/download-artifact@v4
+        with:
+          name: ${{ matrix.os.download_name }}_py${{ matrix.python-version }}
+          path: dist
+      - name: Show wheels downloaded
+        run: ls -lh dist
+        shell: bash
+      - name: Upgrade setuptools, pip and wheel
+        run: python -m pip install -U setuptools pip wheel
+      - name: Install tox
+        run: python -m pip install tox>=4
+      - name: Run tests
+        run: python -m tox run -e aio
+        env:
+          PYTHON_VERSION: ${{ matrix.python-version }}
+          cloud_provider: ${{ matrix.cloud-provider }}
+          PYTEST_ADDOPTS: --color=yes --tb=short
+          TOX_PARALLEL_NO_SPINNER: 1
+        shell: bash
+      - name: Combine coverages
+        run: python -m tox run -e coverage --skip-missing-interpreters false
+        shell: bash
+      - uses: actions/upload-artifact@v4
+        with:
+          name: coverage_aio_${{ matrix.os.download_name }}-${{ matrix.python-version }}-${{ matrix.cloud-provider }}
+          path: |
+            .tox/.coverage
+            .tox/coverage.xml
+
+  test-unsupporeted-aio:
+    name: Test unsupported asyncio ${{ matrix.os.download_name }}-${{ matrix.python-version }}
+    runs-on: ${{ matrix.os.image_name }}
+    strategy:
+      fail-fast: false
+      matrix:
+        os:
+          - image_name: ubuntu-latest
+            download_name: manylinux_x86_64
+        python-version: [ "3.9", ]
+    steps:
+      - uses: actions/checkout@v4
+      - name: Set up Python
+        uses: actions/setup-python@v4
+        with:
+          python-version: ${{ matrix.python-version }}


To fix this issue, the permissions: key should be set explicitly for the test-aio job, or globally at the workflow level if all jobs share the same minimal needs. For jobs that only need to check out code and upload/download artifacts (such as test-aio), setting contents: read is sufficient and recommended. This prevents GITHUB_TOKEN from having broader write privileges. The explicit block should look like:

permissions: contents: read

The change should be made within .github/workflows/build_test.yml, either globally (above jobs:) or locally in the definition for the test-aio: job. If other jobs in the workflow might require different permissions, setting it per-job is safer; otherwise, global is preferred for consistency. For this fix, since CodeQL specifically flagged line 449 (the start of the test-aio job), the precise minimal fix is to add permissions: contents: read to the job definition immediately below test-aio: (i.e., after line 449).

No additional dependencies or code changes are required.

src/snowflake/connector/aio/auth/_auth.py

+            {
+                k: v if k in AUTHENTICATION_REQUEST_KEY_WHITELIST else "******"
+                for (k, v) in body["data"].items()
+            },


To fix the risk of cleartext logging of sensitive information, we must guarantee that sensitive keys (such as "PASSCODE", "password", etc.) are never logged.
The best approach is:

Maintain a separate set of sensitive keys (e.g., SENSITIVE_AUTH_KEYS) that are explicitly redacted regardless of the whitelist.

When constructing the log message, always redact the value for any key in SENSITIVE_AUTH_KEYS, even if it is also present in the whitelist.

Double-check that the PASSCODE (and similar sensitive keys) are always redacted.

Importantly, make this redaction explicit and simple, so future maintainers cannot mistakenly expose sensitive values by merely editing the whitelist.

Implementation plan:

Define a set (or list) called SENSITIVE_AUTH_KEYS, including "PASSCODE", "password", and any similar key.

In the log message dictionary comprehension, for each key:

If key is in SENSITIVE_AUTH_KEYS, log "******".

Else if key is in the existing whitelist, log its value.

Else, log "******".

Edit only the relevant log statement and local definitions in src/snowflake/connector/aio/auth/_auth.py.

No external library is needed; use built-in Python set/dict logic.

src/snowflake/connector/auth/oauth_code.py

+        received_redirected_request = input(
+            "Enter the URL the OAuth flow redirected you to: "
+        )
+        code, state = self._parse_authorization_redirected_request(


To fix the issue, the code should avoid displaying the OAuth URL with sensitive parameters in cleartext. Instead, before printing the URL for manual use, sensitive parameters embedded in the URL (notably client_id, and potentially authorization codes/tokens if present) should be redacted or masked.
The optimal solution is to parse the URL, locate known sensitive query parameters (client_id, client_secret, possibly code and token), and replace their values with placeholders (e.g., '***') before printing.
Edit only the affected region in src/snowflake/connector/auth/oauth_code.py where the URL is printed in _ask_authorization_callback_from_user, adding a small helper method for redaction and changing the print to print the sanitized version.

test/integ/aio_it/test_connection_async.py

+                    c[0][1].startswith(
+                        "https://test-account.snowflakecomputing.com:443"
+                    )


The best way to fix this issue is to parse the potentially returned URL (i.e., c[0][1]) using Python's urllib.parse.urlparse and then check the individual fields (scheme, host, port) for correctness. This avoids confusing "starts with" logic due to userinfo and other URL fields, and directly asserts correct URL composition. Specifically, the assertion in the test should be modified so that for each call in mocked_fetch.call_args_list, the argument at [0][1] is parsed as a URL, and the test asserts that the scheme is "https", host is "test-account.snowflakecomputing.com", and port is 443 (or the absence of port in the netloc is OK if the library omits explicit port 443).

This requires adding from urllib.parse import urlparse if not already imported in this file.
The instrumentation of the assertion should be changed from a startswith check to an explicit dissection and validation of the URL fields.

test/integ/aio_it/test_put_get_with_azure_token_async.py

+    # azure_request_present = False
+    expected_token_prefix = "sig="
+    for line in caplog.text.splitlines():
+        if "blob.core.windows.net" in line and expected_token_prefix in line:


To fix the problem, the code should parse potential URLs in log lines and verify that any host matches (or ends with) 'blob.core.windows.net' rather than checking for this substring anywhere in the log line. The best way is to attempt to extract URLs from each log line and, for each, parse out the host portion. One can use the re module to extract URLs and urllib.parse.urlparse to get hosts, and then check if any host matches or ends with 'blob.core.windows.net'. Only those lines are subjected to further checks about expected_token_prefix. This keeps the test intent intact while avoiding false positives inherent to pure substring checks.

Required changes:

Add import re at the top (if not present).

Add from urllib.parse import urlparse at the top (if not present).

Refactor the for-loop on log lines:

Extract all candidate URLs from the line using regex.

For each URL, parse it and check if its host matches/ends with 'blob.core.windows.net'.

Only run the sensitive-information assertion on such lines.

All changes are localized to the test file and specifically to lines near the existing substring check.

test/unit/test_ocsp.py

+    private_key = rsa.generate_private_key(
+        public_exponent=65537, key_size=1024, backend=default_backend()
+    )


To fix the security issue, the RSA key generated on line 146 should be at least 2048 bits, per current recommendations. Change the line

private_key = rsa.generate_private_key( public_exponent=65537, key_size=1024, backend=default_backend() )

so that key_size=2048 (or higher). This change preserves the intent and functionality of the test code with a secure key size. No other code modifications are necessary, and no additional imports or libraries are required since the rest of the key creation process remains unaffected.

sfc-gh-pczajka and others added 30 commits August 11, 2025 13:44

Remove autodetect tests

7b7570c

NO-SNOW: Run test when targeting branches other than main (#2221)

d4712f5

SNOW-2007887: improve error message handling related to timeout (#2236)

57c7802

[ASYNC] Apply #2236 to async code

c53c827

SNOW-1789751: Add GCP regional and virtual endpoints support (#2233)

f19127c

[ASYNC] Apply #2233 to async code

e0a46ea

SNOW-2021009: test optimisation (#2388)

41c99b6

SNOW-2226057: GH Actions moved to key-pair, old driver bump to 3.1.0 (#…

23df743

…2432)

Apply changes to async tests and workflows

d9704e3

review fixes

a28725f

Freeze pytest-rerunfailures

75646bb

cherry-pick #2515

42d09d9

Apply #2515 to async code

66722cd

SNOW-2027116 Allow for UUID encoding in SnowflakeRestful interface (#…

6477ab8

…2254)

[ASYNC] apply #2254 to async code

4766048

SNOW-1955965: Fix expired S3 credentials update (#2258)

4c4f4f8

[ASYNC] apply #2258 to async code

63f101e

NO-SNOW Add PAT to authenticators allowing empty username, remove han…

01d9042

…dling of PAT in password field (#2264)

[ASYNC] Apply #2264 to async code

a673238

NO-SNOW Fix flaky query timeout test (#2266)

7ee9187

SNOW-2040000 change tag to bptp-stable (#2268)

9e5e77f

SNOW-2028051 introduce a new client_fetch_threads connection paramete…

ce7a5c2

…r to decouple threads number limitations on fetching and pre-fetching (#2255)

Add default entra app ID for Snowflake (#2267)

02e9dce

[ASYNC] update test after #2267

faf60a3

SNOW-2011595 Masking filter introduced on library levels (#2253)

7a2c121

[ASYNC] remove azure filter after #2253

14b1457

NO-SNOW acquiring a lock on local OCSP cache will use a timeout (#2280)

344a768

Accept both v1 and v2 Entra ID issuer formats for WIF (#2281)

30f0116

[ASYNC] apply #2281 to async code

5b3c6ee

SNOW-2048239 revert zero timeout for oscp cache lock (#2283)

9ef4e65

sfc-gh-pcyrek and others added 28 commits November 3, 2025 08:35

SNOW-2220712-extending-probers-with-fail-close-mode (#2533)

b3e598f

(cherry picked from commit 01d4159)

Environment variable to force browser-based auth (#2538)

1a2af3d

Co-authored-by: Patryk Czajka <patryk.czajka@snowflake.com> (cherry picked from commit 54906d6)

[async] Applied #2538 to async code

65a0746

SNOW-1763096: Fix async telemetry support (#2590)

6b77055

SNOW-2333702 Fix types for DictCursor (#2532)

334bc33

Co-authored-by: Patryk Czajka <patryk.czajka@snowflake.com>

Add option to exclude boto3 and botocore from dependencies (#2525)

7902b54

SNOW-2338989: Ensure Arrow to_pandas maps Interval types (#2536)

cf563f8

NO-SNOW: Fix failing test_invalid_connection_parameters_turned_off (#…

976b32c

…2631) The failure was caused by boto PythonDeprecationWarning. To avoid if/else logic for checking boto availability I decided to check suffixes of the warnings instead of their types.

Temporary fix broken pipeline (#2622)

096ab0d

SNOW-2274410: Set no retries for WIF authentication (#2494)

60a6534

[async] Set no retries for WIF authentication (#2494)

0cf4ad0

REMOVE: temp fix begore rebase

c5327bc

Update WIF integration tests to verify authenticated username + prepa…

7987e90

…re for impersonation (#2510)

[async] adjust async wif tests (#2510)

e5aeea7

Support WIF Impersonation on GCP workloads (#2496)

6324c39

Co-authored-by: Peter Mansour <peter.mansour@snowflake.com>

[async] WIF impersonation for GCP #2496

a5ea494

Support WIF Impersonation on AWS workloads (#2517)

f052550

Co-authored-by: Peter Mansour <peter.mansour@snowflake.com>

[async] WIF impersonation for AWS (#2517)

e00c593

Add WIF impersonation path length as data sent to Snowflake backend (#…

7976686

…2521)

[async] Add WIF impersonation path length as data sent to Snowflake b…

13b7365

…ackend (#2521)

Fixup test_wif.sh

699e587

skip async WIF tests

cce6664

remove linting issues

8e5f131

Code review

6b9d691

SNOW-1675422: add write pandas - part 1. - Added all files related to…

0438d37

… write_pandas

SNOW-1675422: removed pd_writer utils as those wont work with async i…

cad0229

…n pandas.to_sql anyway

SNOW-1675422: review fixes

dbcae65

Remove comments

3daae3a

github-advanced-security bot found potential problems Nov 12, 2025

View reviewed changes

@@ -43,6 +43,8 @@
             logger = logging.getLogger(__name__)
+            # Always redact these keys, even if they are included in any whitelist
+            SENSITIVE_AUTH_KEYS = {"PASSCODE", "PASSWORD", "password", "passcode"}
             class Auth(AuthSync):
                 async def authenticate(
@@ -146,7 +148,7 @@
                     logger.debug(
                         "body['data']: %s",
                         {
-                            k: v if k in AUTHENTICATION_REQUEST_KEY_WHITELIST else "******"
+                            k: "******" if k in SENSITIVE_AUTH_KEYS else (v if k in AUTHENTICATION_REQUEST_KEY_WHITELIST else "******")
                             for (k, v) in body["data"].items()
                         },
                     )

@@ -367,7 +367,7 @@
                         "We were unable to open a browser window for you, "
                         "please open the URL manually then paste the "
                         "URL you are redirected to into the terminal:\n"
-                        f"{authorization_request}"
+                        f"{self._redact_sensitive_url_params(authorization_request)}"
                     )
                     received_redirected_request = input(
                         "Enter the URL the OAuth flow redirected you to: "
@@ -389,6 +389,22 @@
                         )
                     return code, state
+                @staticmethod
+                def _redact_sensitive_url_params(url: str) -> str:
+                    """Redact sensitive OAuth query parameters from URL."""
+                    parsed_url = urllib.parse.urlparse(url)
+                    params = urllib.parse.parse_qs(parsed_url.query, keep_blank_values=True)
+                    # Redact known sensitive parameters
+                    sensitive_keys = {"client_id", "client_secret", "code", "token"}
+                    for key in sensitive_keys:
+                        if key in params:
+                            params[key] = ["***"]
+                    redacted_query = urllib.parse.urlencode(params, doseq=True)
+                    redacted_url = urllib.parse.urlunparse(
+                        parsed_url._replace(query=redacted_query)
+                    )
+                    return redacted_url
                 def _parse_authorization_redirected_request(
                     self,
                     url: str,

@@ -22,6 +22,7 @@
             import snowflake.connector.aio
             from snowflake.connector import DatabaseError, OperationalError, ProgrammingError
+            from urllib.parse import urlparse
             from snowflake.connector.aio import SnowflakeConnection
             from snowflake.connector.aio._description import CLIENT_NAME
             from snowflake.connector.compat import IS_WINDOWS
@@ -721,8 +722,12 @@
                         )  # Skip tear down, there's only a mocked rest api
                         assert any(
                             [
-                                c[0][1].startswith(
-                                    "https://test-account.snowflakecomputing.com:443"
+                                (
+                                    (lambda u: (
+                                        u.scheme == "https"
+                                        and u.hostname == "test-account.snowflakecomputing.com"
+                                        and (u.port == 443 or (u.port is None and u.netloc.endswith(":443")))
+                                    ))(urlparse(c[0][1]))
                                 )
                                 for c in mocked_fetch.call_args_list
                             ]

@@ -10,6 +10,8 @@
             import sys
             import time
             from logging import getLogger
+            import re
+            from urllib.parse import urlparse
             import pytest
@@ -92,14 +94,17 @@
                 # azure_request_present = False
                 expected_token_prefix = "sig="
                 for line in caplog.text.splitlines():
-                    if "blob.core.windows.net" in line and expected_token_prefix in line:
-                        # azure_request_present = True
-                        # getattr is used to stay compatible with old driver - before SECRET_STARRED_MASK_STR was added
-                        assert (
-                            expected_token_prefix
-                            + getattr(SecretDetector, "SECRET_STARRED_MASK_STR", "****")
-                            in line
-                        ), "connectionpool logger is leaking sensitive information"
+                    # Find all potential URLs in the line
+                    urls = re.findall(r'(https?://[^\s\'"<>]+)', line)
+                    for url in urls:
+                        host = urlparse(url).hostname
+                        if host and host.endswith("blob.core.windows.net") and expected_token_prefix in url:
+                            # getattr is used to stay compatible with old driver - before SECRET_STARRED_MASK_STR was added
+                            assert (
+                                expected_token_prefix
+                                + getattr(SecretDetector, "SECRET_STARRED_MASK_STR", "****")
+                                in url
+                            ), "connectionpool logger is leaking sensitive information"
                 # TODO: disable the check for now - SNOW-2311540
                 # assert (

@@ -144,7 +144,7 @@
             def create_x509_cert(hash_algorithm):
                 # Generate a private key
                 private_key = rsa.generate_private_key(
-                    public_exponent=65537, key_size=1024, backend=default_backend()
+                    public_exponent=65537, key_size=2048, backend=default_backend()
                 )
                 # Generate a public key

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[DRAFT] AIO PR #2641

[DRAFT] AIO PR #2641

sfc-gh-turbaszek commented Nov 12, 2025

Uh oh!

Check warning

Copilot Autofix

Check warning

Copilot Autofix

Check failure

Copilot Autofix

Check failure

Copilot Autofix

Check failure

Copilot Autofix

Check failure

Copilot Autofix

Check failure

Copilot Autofix

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

32 participants

@@ -1,4 +1,6 @@
             name: Build and Test
+            permissions:
+              contents: read
             on:
                 push:

[DRAFT] AIO PR #2641

Are you sure you want to change the base?

[DRAFT] AIO PR #2641

Conversation

sfc-gh-turbaszek commented Nov 12, 2025

Uh oh!

Check warning

Copilot Autofix

Check warning

Copilot Autofix

Check failure

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot Autofix

Check failure

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot Autofix

Check failure

Uh oh!

Copilot Autofix

Check failure

Uh oh!

Copilot Autofix

Check failure

Uh oh!

Copilot Autofix

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

32 participants