All projects in this language were done outside of my studies. I had the chance to use an SQLite database, parse commands using the console and predict the seasonality of sales. Additionally, I created my model for customer segmentation that compares two groups and indicates which one should be targeted. In this case, the requests are sent in JSON format by the Postman. Currently, I’m working on the NLP processing project “Identification of drug interactions from the summary of product characteristics”.
Some of my projects:
Project | Description |
---|---|
Identification of drug interactions from the summary of product characteristics | The website (in polish) allows you to process the SmPC (Summary of Product Characteristics) to find interactions between the substances. Every medicine authorized in Poland has the SmPC, including the section Interactions with other medicinal products and other forms of interaction . Based on this passage, I have tried to extract the names of substances that interact with the product. The list of substances is taken from the Register of Medicinal Products, which contains links to the SmPC of the medicine in question and the names of the active substances in foreign (English/Latin). The whole site was deployed using the Streamlit library. NLP processing is done using the spacy library and thefuzz for comparing names of medicinal substances. |
Prediction model of sales in alcohol stores by using the Prophet | Build a prediction model by the prophet to indicate if the credit could be granted to some stores. For the stores, there is information about the revenue of the alcohol sales. The clustering has been implemented to find stores with similar attributes. |
Forecasting of sales | The aim was to create a model prediction of the sales for the next three weeks. Currently, the sales forecast is set 3 weeks ahead based on last week’s sales. The Weighted Absolute Percent Error (WAPE) is used for comparison purposes. The whole dataset contains 3 CSV files. prophet pandas |
Predicting profitable customer segments | Models for customer segmentation that compares two groups and indicates which one should be targeted were created. Based on the approaches, five different models have been made. For one of the models (GradientBoostingClassifier) to predict if the campaign should be launched for the group (one of them, or none of them)), the requests are sent in JSON format by the Postman. GradientBoostingClassifier LogisticRegression KNN modeling pandas numpy |
Stock price: jumping out and in of dividend stocks around ex dividend dates | The payment of dividends by a company is an attractive morsel for a shareholder. Such a company is better perceived because of its attractiveness and its willingness to share its profits with investors. But is it always profitable to own shares when dividends are paid? In this note, I will try to answer this question. For the analysis, I have chosen companies that regularly pay dividends on the Polish stock exchange. pandas BeautifulSoup requests |