<#C069CQNHDNF|> I am facing this issue while testi...
# troubleshooting
h
#C069CQNHDNF I am facing this issue while testing tap-s3-csv for both: • Steve Clarke (default):second_place_medal: • Wise and there is no Integer field in .csv file please let me know how we can fix it.
👀 1
c
Can you post your config?
h
means meltano.yml?
This is my config
@Conner Panarella (SpaceCondor) please let me know if you have any solution
c
Can you send the full meltano.yml? Removing any secrets you may have
h
This is the complete details for extractor
but please let me know if you need whole meltano.yml
c
can you try quoting the start_date?
h
no still same error
c
It is hard to debug without all the info, but I would try quoting the values of the config just to see if that works. If all the plugins report the same error with the config, it sounds like something with that
h
can we connect on zoom?
c
I'm not available right now, but did you try quoting the values?
h
Yes tried quoting all values
but not worked
It worked for me but I did this change on installed library csv.field_size_limit(sys.maxsize) commented this code in tap_s3_csv/sync.py
e
There's a huge difference between the default and
sys.maxsize
though:
Copy code
>>> csv.field_size_limit()
131072
>>> sys.maxsize
922337203685477580
Can you confirm that your CSV has a field with more than 131072 bytes?
h
I don't have any field whose size is more than this default max size.
🤔 1
can I get solution for this?
c
Can you test with another CSV file just to ensure it's not a problem with the config?
h
I tested it with multiple files
but faced same issue
c
Even simple ones?
h
Yes even with simple one- where only one column containing country names.
c
Can you provide that one?
h
population.csv
population1.csv
c
Can you send the config after you quoted the values?
h
image.png
s
I had no problem ingesting both those files.
Copy code
- name: tap-s3-csv
    pip_url: git+<https://github.com/s7clarke10/pipelinewise-tap-s3-csv.git@v3.0.0>
    variant: s7clarke10
  - name: tap-s3-csv--population
    inherit_from: tap-s3-csv
    config:
      bucket: "mybucket"
      start_date: "2000-01-01T00:00:00Z"
      tables:
      - table_name: "population"
        key_properties:
        - "Country"
        search_prefix: "tst/dev/"
        search_pattern: ".*population.*[.]csv"
        delimiter: ','
        datatype_overrides:
          Country: "string"
          Population: "integer"
          GDP: "string"
          Area: "integer"
h
- name: tap-s3-csv pip_url: git+https://github.com/s7clarke10/pipelinewise-tap-s3-csv.git@v3.0.0 variant: s7clarke10 - name: tap-s3-csv--population inherit_from: tap-s3-csv config: bucket: "audiencex-test-files" start_date: "2000-01-01T000000Z" tables: - table_name: "population" key_properties: - "Country" search_prefix: "ipr5zct-piewie-clicks/" search_pattern: ".*population.*[.]csv" delimiter: ',' datatype_overrides: Country: "string" Population: "integer" GDP: "string" Area: "integer" Here I kept everything as you shared and just changed the bucket and search prefix but still the same issue. I am using Windows OS FYI
s
Looking at the error message it seems like a permissions and config issue with Meltano. Here is the results of my config.
Copy code
(.venv) [sclarke@server]$ meltano config tap-s3-csv--population
{
  "aws_access_key_id": "<removed>",
  "aws_secret_access_key": "<removed>",
  "bucket": "mybucket",
  "start_date": "2000-01-01T00:00:00Z",
  "tables": [
    {
      "table_name": "population",
      "key_properties": [
        "Country"
      ],
      "search_prefix": "tst/dev/",
      "search_pattern": ".*population.*[.]csv",
      "delimiter": ",",
      "datatype_overrides": {
        "Country": "string",
        "Population": "integer",
        "GDP": "string",
        "Area": "integer"
      }
    }
  ]
}
I would try a clean meltano install followed by.
Copy code
meltano init
Then
Copy code
meltano add extractor tap-s3-csv
h
Tried this also but not worked