OAK

Customs Import Declaration Datasets

Metadata Downloads
Author(s)
Chaeyoon Jeong / 정채윤Sundong Kim / 김선동Jaewoo Park / 박재우Yeonsoo Choi / 최연수
Type
Conference Paper
Citation
17th annual WCO PICARD Conference
Issued Date
2022-12-07
Abstract
Given the huge volume of cross-border flows, effective and efficient control of trades becomes more crucial in protecting people and society from illicit trades while facilitating legitimate trades. However, limited accessibility of the transaction-level trade datasets hinders the progress of open research, and lots of customs administrations have not benefited from the recent progress in data-based risk management. In this paper, we introduce an import declarations dataset to facilitate the collaboration between the domain experts in customs administrations and data science researchers. The dataset contains 54,000 artificially generated trades with 22 key attributes, and it is synthesized with CTGAN while maintaining correlated features. Synthetic data has several advantages. First, releasing the dataset is free from restrictions that do not allow disclosing the original import data. Second, the fabrication step minimizes the possible identity risk which may exist in trade statistics. Lastly, the published data follow a similar distribution to the source data so that it can be used in various downstream tasks. With the provision of data and its generation process, we open baseline codes for fraud detection tasks, as we empirically show that more advanced algorithms can better detect frauds.
Publisher
세계 관세 기구 (World Customs Organization)
Conference Place
BE
브뤼셀
URI
https://scholar.gist.ac.kr/handle/local/21741
공개 및 라이선스
  • 공개 구분공개
파일 목록
  • 관련 파일이 존재하지 않습니다.

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.