automatic conversion of a full product name into a "searchable" product name - data-mining

given product names which are retrieved from an external source, I am trying to search for these products on social media. The process should be automated. I am facing a problem that the full names of products are not used on social media.
E.g. Samsung Galaxy S3 i9300 16GB is the full name which is retrieved, but noone uses it. Everyone uses a shorten version like Samsung Galaxy S3 or Galaxy S3.
Or Casio G-Shock GA-110NM-2 Sports Stylish Watch - Blue is rather used as G-Shock GA-110NM-2.
Would you have an idea how to automatise or reasonably semi-automatise this process?
Thank you very much,
Adam

Related

Amazon Selling-Partner-API - How to update price and Quantity of the product

We wants to update the products attributes like quantity, price etc using the Selling Partner API.
We are referring following documents for the same.
https://github.com/amzn/selling-partner-api-docs/blob/main/guides/en-US/developer-guide/SellingPartnerApiDeveloperGuide.md#what-is-the-selling-partner-api
Which API we can use from the following list to update price and quantity
i.e from API list https://github.com/amzn/selling-partner-api-docs/tree/main/references
https://github.com/amzn/selling-partner-api-docs/tree/main/references
Selling Partner API for Pricing - In document Amazon have not mentioned about POST API
i.e to API to update price.
https://github.com/amzn/selling-partner-api-docs/blob/main/references/product-pricing-api/productPricingV0.md
Most likely you will have to use the feed api.
Create feed will require the proper type. If your intent is to only update price and quantities of existing listings, you can use the type POST_FLAT_FILE_PRICEANDQUANTITYONLY_UPDATE_DATA
The body of the request should be the equivalent of a manual price and quantity file submitted though amazon portal.
So it's like (please check yourself for more precise details):
sku | price | quantity
ITEM-SKU | itemprice | itemqty
with tabs in place of "|"
As i am just starting to develop this aspect (up to now i managed to implement order fulfillment tracking pushes and inventory download) this could very well be incorrect. Will update once i get deeper in the implementation of this
If you are totally new to the feed api take a look at https://developer-docs.amazon.com/sp-api/docs/feeds-api-v2021-06-30-reference
The process is not that straight forward but it works in the end.
Hope this helps you
Edit: i tested a simple upload that changed one item price and quantity ad it worked fine. The actual implementation is not hard as long as the prerequisite knowledge / details are clear:
general SP-api knowledg about token, roles, signing etc.
feed submission workflow - explained here: https://developer-docs.amazon.com/sp-api/docs/feeds-api-v2021-06-30-use-case-guide

Using google cloud for image classification, cropping and OCR

Please allow me to ask a rather newbie question. So far, I have been using local tools like imagemagick or GOCR to perform the job, but that is rather old-fashioned, and I am urged to "move to google cloud AI".
The setup
I have a (training) data set of various documents (as JPG and PDF) of different kinds, and by certain features (like prevailing color, repetitive layout) I intend to classify them, e.g. as invoice type 1, invoice type 2, not an invoice. In a 2nd step, I would like to OCR certain predefined areas of each document and extract e.g. the address of the company sending the invoice and the date.
The architecture I am envisioning
In a modern platform as a service (pass), I have already set up an UI where I can upload new files. These are then locally stored in a directory with filenames (or in a MongoDB). Meta info like upload timestamp, user, original file name is stored in a DB.
The newly uploaded file should should then be submitted to google cloud which should do the classification step, and deliver back the label to be saved in the database.
The document pages should be auto-cropped, i.e. black or white margins are removed, most probably with google cloud as well. The parameters of the crop should be persisted in the DB.
In case it is e.g. an invoice, OCR should be performed (again by google cloud) for certain regions of the documents, e.g. a bounding box of spanning from the mid of the page to the right margin in the upper 10% of the cropped page. The results of the OCR should be again persisted locally.
The problem
I seem to be missing the correct search term to figure out how to do it with google cloud. Is there an google-API (e.g. REST), I can use to upload and which gives me back the results of steps 2 to 4?
I think that your best option here is to use Document AI (REST API and Libraries).
Using Document AI, you can:
Convert images to text
Classify documents
Analyze and extract entities
Additionally, for your use case, we have a new Document AI feature that is still in preview and has limited access which is the Invoice parser.
Invoice parser is similar to Form parser but for invoices instead of forms. Check out the Invoice parser page and you will see what I mean by preview and limited access.
AFIK, there isn't any GCP tool for image edition.

Amazon product api get stock availability (In Stock, out of Stock, etc)

I am using product advertising api by Amazon. Its a little lacking to be honest, since there is no way of getting the product name. You can get the title, but that contains the offer "Amazon Kindle 3G, with free wifi..." instead of simply Amazon Kindle 3G.
More importantly, there seems to be no way of getting the stock information. I just need to know if an item is in stock or not. That's all, there seems to be no way of doing that yet. Ama I missing something. I am using the ItemLookup Api. I get some details by using the Large Response Group but no stock information
The various 'offers' response groups list 'offers' from Amazon and 3rd party merchants; this data includes availability details as well as pricing for the (just) 1 offer it will tell you about. You can supply a MerchantId parameter forcing that one offer to be from Amazon.
This API has been somewhat limited (i.e. to just 1 offer) since 2011; for fuller info try the MWS API instead.
For a more literal product name, you could try obtaining the UPC or EAN from the ItemAttributes response group, and then looking that up in a non-Amazon UPC database. No idea how well that would work in practice.
Although I am using amazon product api with a python wrapper..there is a availability parameter - checkout the amazon doc http://docs.aws.amazon.com/AWSECommerceService/latest/DG/CHAP_ReturningPriceAndAvailabilityInformation-itemsearch.html

Using Amazon services to query products on the amazon website

I'm creating a application where the user will type in the name of a video game and a query will be sent to a servlet. I want this query to search the amazon product database and if the game is found i want to grab the information such as the name, publisher, platform genre etc and add this information to my database. Just like price grabber does. There is an example below.
http://video-games.pricegrabber.co.uk/nintendo-ds-games/m/25813985/details/st=product_tab/
Can this be done and if so what will i need to know and learn to do this.
I believe you should be able to use the Amazon Product Advertising API to query for this kind of product information. However the current Terms of Use appear to restrict you from storing the data into your database for more than 24 hours.

Locating Businesses by Latitude and Longitude (API?/Webservice?)

I am looking for a free database (csv, tab delimited, xml, etc) or even a web service or API which will allow me to query for business listings within a certain distance of a latitude and longitude.
A global scope would be perfect, but I would settle for Canada/US information.
Does anyone know of anything like this?
Google AJAX search API offers this kind of service (local search) if you use the Google Maps API, see http://code.google.com/intl/fr/apis/ajaxsearch/documentation/
Apparently Yahoo! does too (see http://developer.yahoo.com/search/local/localSearch.html), if you do not plan a commercial use.
If you want to do this right, here is what you need:
Database of all locations, with long/lat coords
Database of all businesses
Address --> long/lat conversion system, if your business database is not already geocoded
I can't help you out with the business database, but here is a good resource for the "all locations" database:
http://www.geobytes.com/freeservices.htm
(Scroll down to the bottom of the page - you're looking for "GeoWorldMap".)
It contains a set of text files that are designed to be imported into a relational database. It's the most complete (free) resource that I've been able to find for this sort of task.
The Google Maps Places API provides exactly what you're looking for: https://developers.google.com/places/documentation/search