SNOW-1232488 Column names are converted to lowercase #157

RoyalTS · 2020-02-19T09:16:08Z

All column names, irrespective of their casing in the Snowflake database, seem to be converted to lower-case.

What version of Python are you using (python --version)?

3.7.3

What operating system and processor architecture are you using (python -c 'import platform; print(platform.platform())')?

Linux-4.15.0-1060-aws-x86_64-with-debian-buster-sid

What are the component versions in the environment (pip list)?

pandas                          0.25.3
snowflake-connector-python       2.1.3
snowflake-sqlalchemy             1.2.0

What did you do?

import sqlalchemy as sa
from snowflake.sqlalchemy import URL

engine = db.create_engine(URL(...))

res = engine.execute('''
SELECT
NULL::TEXT AS COL1,
NULL::TEXT AS "col2",
NULL::TEXT AS "COL3"
''')
row = res.fetchone()
row.keys()

What did you expect to see?

['COL', 'col2', 'COL3']

(which is the column names I get when I grab the query result using the Snowflake Connector's fetch_pandas_all().

What did you see instead?

['col1', 'col2', 'col3']

The text was updated successfully, but these errors were encountered:

RoyalTS · 2020-03-10T08:29:01Z

@keller00 I see you self-assigned this. Does that mean you're working on this? If so, got a rough timeline?

keller00 · 2020-03-10T16:48:15Z

@keller00 I see you self-assigned this. Does that mean you're working on this? If so, got a rough timeline?

I will when I find some time, I'm just trying to let you know that I have looked at it. But if yo need this in a short time frame, then I'd encourage you to open a PR, this helps us out immensely even if we end up fixing bugs in the PR submitted.

This whole issue comes from the fact that if you single quote in Snowflake then it is case sensitive, while not quoting something makes the string upper case by default.
This would change behaviour of the connector and would need to be announced ahead of time.

patrickmcdougle-okta · 2021-03-26T04:14:18Z

This causes issues with tools like Apache Superset that use introspection and column name matching to have shared filters across multiple visualizations. Sometimes the columns are full uppercase and sometimes they aren't.

rareal · 2021-07-10T10:59:56Z

Hi, I'm sorry for the annoying question, but are there any updates on this one?

patrickhowerter · 2021-08-27T20:27:41Z

Wouldn't it make more sense for it to be uppercase? Snowflake defaults to returning column names as uppercase.

struck89 · 2021-09-06T14:41:08Z

Same issue here, but if you mix upper case and lower case it obeys:
Try:
SELECT NULL::TEXT AS COL1, NULL::TEXT AS "col2", NULL::TEXT AS "COL3", NULL::TEXT AS "Col4"

Edit: Found this: #73

It is my understanding that this can be closed, as the correct cases can be found from the cursor description.

Continuing with the code given above:
res.cursor.description

[('COL1', 2, None, 16777216, None, None, True),
 ('col2', 2, None, 16777216, None, None, True),
 ('COL3', 2, None, 16777216, None, None, True),
 ('Col4', 2, None, 16777216, None, None, True)]

Instead of using pandas (pd.read_sql) I just updated my code to run the following:

res = snow.execute(sql_stmt)
columns = [line[0] for line in res.cursor.description]
df = pd.DataFrame(res, columns = columns)

patrickhowerter · 2021-09-28T20:00:48Z

@keller00 @sfc-gh-abhatnagar Do you think you can assign this to someone else that works on the project? It's a pretty difficult bug to work around.

fflakito · 2021-11-29T17:33:31Z

+1!
I'd be very much interested in seeing a fix for this.

cpcloud · 2023-03-15T14:19:14Z

Since this behavior is fundamentally broken, we had to work around this in ibis in ibis-project/ibis#5741.

If you use ibis, we do the minimum amount of munging need to make things work and we preserve the quoting behavior of snowflake instead of the broken behavior in snowflake-sqlalchemy.

seb-kro · 2023-04-24T11:36:12Z

We are also maintaining internal workarounds for the broken case handling. Would be very helpful to have this fixed inside snowflake-sqlalchemy itself, especially since it is such a fundamental issue that is already known for years and should be rather straightforward to fix. It even seems that snowflake.sqlalchemy.snowdialect.SnowflakeDialect already has a reasonable implementation of normalize_name and denormalize_name. But especially denormalize_name just doesn't seem to be applied when needed.

This issue even extends to snowflake.connector, where e.g. some table names are simply converted to uppercase (connector.pandas_tools.pd_writer). So you need to maintain a workaround there as well if you want to support mixed case handling.

write_pandas(
        conn=sf_connection,
        df=df,
        # Note: Our sqlalchemy connector creates tables case insensitively
        table_name=table.name.upper(),
        schema=table.schema,
        **kwargs,
)

sfc-gh-dszmolka · 2024-03-13T11:40:41Z

hi folks - apologies for leaving this unanswered for so long; we're changing that going forward.
for now, possibly

might be all originating from the same gap in snowflake-sqlalchemy. At this time, I cannot promise any timeline for taking care of this, but rest assured we're aware of the issue and i'll keep this thread posted.

Catmktg-Bob-Jia · 2024-12-18T04:35:40Z

in case anyone else getting the same confusion, it looks like

if column name has "_" as prefix, Pandas DataFrame column name will be all-upper-case, regardless of the Snowflake SQL
if column name does NOT have "_" as prefix, Pandas DataFrame column name will be all-lower-case, regardless of the Snowflake SQL

.read_sql( 'SELECT 1 AS col' )['col'] --> pass
.read_sql( 'SELECT 1 AS col' )['COL'] --> fail
.read_sql( 'SELECT 1 AS COL' )['col'] --> pass
.read_sql( 'SELECT 1 AS COL' )['COL'] --> fail

.read_sql( 'SELECT 1 AS _col' )['_col'] --> fail
.read_sql( 'SELECT 1 AS _col' )['_COL'] --> pass
.read_sql( 'SELECT 1 AS _COL' )['_col'] --> fail
.read_sql( 'SELECT 1 AS _COL' )['_COL'] --> pass

keller00 self-assigned this Feb 19, 2020

rumbin mentioned this issue Jan 19, 2022

Snowflake: Inconsistent column name case apache/superset#18085

Closed

3 tasks

kaxil mentioned this issue Oct 13, 2022

Disable the SDK from changing capitalization of output columns by default astronomer/astro-sdk#917

Closed

cpcloud mentioned this issue Mar 14, 2023

fix(snowflake): ensure that quoting matches snowflake behavior, not sqlalchemy ibis-project/ibis#5741

Merged

kklein mentioned this issue May 31, 2023

Snowflake column name case-sensitive Quantco/datajudge#148

Open

Kilo59 mentioned this issue Aug 15, 2023

[BUGFIX] Snowflake column identifiers great-expectations/great_expectations#8526

Merged

4 tasks

Kilo59 mentioned this issue Sep 5, 2023

[BUGFIX] Fix Snowflake column identifier issue when running checkpoint great-expectations/great_expectations#8630

Closed

4 tasks

Kilo59 mentioned this issue Sep 15, 2023

[BUGFIX] Snowflake column name case sensitivity great-expectations/great_expectations#8719

Merged

3 tasks

anthonyburdi mentioned this issue Oct 10, 2023

[BUGFIX] Column Descriptive Metrics: Convert table name to lowercase for snowflake great-expectations/great_expectations#8817

Merged

4 tasks

sfc-gh-dszmolka assigned sfc-gh-dszmolka and unassigned keller00 Mar 13, 2024

sfc-gh-dszmolka added the status-triage Issue is under initial triage label Mar 13, 2024

sfc-gh-dszmolka changed the title ~~Column names are converted to lowercase~~ SNOW-1232488 Column names are converted to lowercase Mar 13, 2024

sfc-gh-dszmolka added bug Something isn't working status-triage_done Initial triage done, will be further handled by the driver team and removed status-triage Issue is under initial triage labels Mar 13, 2024

This was referenced Mar 13, 2024

SNOW-1232488: Table reflection methods using .format do not respect quoted_name #276

Open

SNOW-1232488: metadata reflection fails for case sensitive (lower/mixed) case objects #388

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SNOW-1232488 Column names are converted to lowercase #157

SNOW-1232488 Column names are converted to lowercase #157

RoyalTS commented Feb 19, 2020

RoyalTS commented Mar 10, 2020

keller00 commented Mar 10, 2020

patrickmcdougle-okta commented Mar 26, 2021

rareal commented Jul 10, 2021

patrickhowerter commented Aug 27, 2021

struck89 commented Sep 6, 2021 •

edited

Loading

patrickhowerter commented Sep 28, 2021

fflakito commented Nov 29, 2021

cpcloud commented Mar 15, 2023

seb-kro commented Apr 24, 2023

sfc-gh-dszmolka commented Mar 13, 2024

Catmktg-Bob-Jia commented Dec 18, 2024

SNOW-1232488 Column names are converted to lowercase #157

SNOW-1232488 Column names are converted to lowercase #157

Comments

RoyalTS commented Feb 19, 2020

RoyalTS commented Mar 10, 2020

keller00 commented Mar 10, 2020

patrickmcdougle-okta commented Mar 26, 2021

rareal commented Jul 10, 2021

patrickhowerter commented Aug 27, 2021

struck89 commented Sep 6, 2021 • edited Loading

patrickhowerter commented Sep 28, 2021

fflakito commented Nov 29, 2021

cpcloud commented Mar 15, 2023

seb-kro commented Apr 24, 2023

sfc-gh-dszmolka commented Mar 13, 2024

Catmktg-Bob-Jia commented Dec 18, 2024

struck89 commented Sep 6, 2021 •

edited

Loading