ABSTRACT

With the rapid development of big data technology, many 'sleeping' data can be utilized, but the source of data is the key point. The previous methods of obtaining data can no longer meet the demand. This article uses python web crawler to down jacket of Alibaba International Station. Information (shell material, structure type, fill material, process information, and style information) is crawled and stored in the MongoDB database for data sources for apparel information analysis.

Keywords: - Data mining; Python web crawler; Clothing information analysis; Down jacket