UPDATED on 7/19/2022
Article Type is a Top Level category
Subcategories are:
Authored [ = Writer]
Curated
Scientific Report
UPDATED on 7/12/2022
Erich on 7/12/2022:
- Stand-up dev environment (sounds like you’ve made great progress on this over the weekend)
- Assess if adding new categories will negatively affect current category assignments, presuming not:
- Determine and establish new top level categories
- Aviva supplied the following categories:
- Article type – Under Parent: Academic Publishing:
- >>> Authored article
- >>> Authored and Curated article
- >>> Curated article
- >>> Curated and Reported article
- >>> Scientific reported article
- Aviva will Create new category in WP for each.
- Scrape each article to determine which category it belongs to and assign it accordingly
- Find WordPress themes well suited to the content and present them to Aviva and me to evaluate/select
- Implement selected theme in dev
- Install “Schema” plugin to automatically add JSON-LD mappings to Schema.org types
- There are a few different schema.org vocabularies that can work. Search for “blog” on the schema.org site to discover options
- Develop strategy to update production environment…
- We’ll research and discuss the safest pipeline to update production.
- Determine and establish new top level categories
This is the basic outline of our steps. Once complete, we should have a much more usable and discoverable site, improved SEO, etc.
UPDATED on 11/7/2021
I wish to see the following Two table:
TABLE 1
Total articles on 11/7/2021: 6,103, VIEWS: 2.062,000
Total Number of Authored articles: X,XXX
Total Number of Curations: Y,YYY
Total Number of Scientific Reports: Z,ZZZ
Total Number of Pages: 418
Total Number of Articles Produced at Conferences: VVVV
TABLE 2
Content Creator Name | Content Type Authored Article Number = | Content Type Curation Number = | Content Type Scientific Report Number = | Multiple Authors N = 2 | Multiple Authors N = 3 | TOTAL N = XXX |
Dr. Larry | ||||||
Dr. Williams | ||||||
Aviva | ||||||
Dr. Dror Nir | ||||||
Dr. Irina | ||||||
Dr. Tilda | ||||||
Dr. RItu | ||||||
June 15, 2021
Getting Started:
Welcome:
Welcome to the documentation of the LPBI Views Extraction Project! This documentation should help you get up to speed on what our project consists of, how to run it.
The task assigned to us was:
- To get the views for each of the 6,000+ articles posted on LPBI’s Journal during a select period of different time periods.
- Should be in a CSV/Excel Format, organized by views by year (if applicable)
- The time spans we had to extract data from was:
- From 2012 to 12/31/2020 (Multiple years)
- From 6/30/2020 to 6/30/2021 (One year)
- From 1/1/2020 to 6/30/2020 (Half of a year)
- From 7/1/2020 to 12/31/2020 (Half of a year)
- From 2012 to 12/31/2021 (Multiple years)
The project also had some specific requirements for the given output, but the main concept with the output was to have the views for each a select time. This took multiple different approaches to get the desired output, but we finally managed to get a working approach with different classes that each contribute to the project.
This project was vital to LPBI’s function, as we can use the data provided by this project to assist LPBI in various fields such as Blockchain and View Forecasting/Prediction for LPBI in the future. It was a very interesting and complex project that we are glad to have built.
Going back to this documentation, each section on this website contains a detailed explanation of each method of this project, and how the class in its entirety works. If you have any particular questions about how the code works for a specific line/function, you can check out the GitHub link to this project to view the code.
Important: In order to access the Github Link, you need to first create an account and send your email and username to Abhisar (abhisar.muz@gmail.com) or Srinivas (srinivassriram06@gmail.com). You can create a Github Account on this page. Once you have created your Github Account, send your email and username to either Abhisar or Srinivas, and we will let you know when you have been added. Then, you will be able to view the Github Page.
Happy reading, and hope you enjoy this package for LPBI!
This project was created by:
Abhisar Anand, Research Assistant I
https://www.linkedin.com/in/abhisar-anand/
Srinivas Sriram, Research Assistant I
https://www.linkedin.com/in/srinivas-profile/
For confirmation that we both performed this task and got recommendations and certificates from our mentor, Dr. Lev-Ari, Ph.D., RN, check out our LinkedIn Profiles.
Legal Use and Rights:
Based on:
From: Aviva Lev-Ari <avivalev-ari@alum.berkeley.edu>
Date: Wednesday, August 18, 2021 at 6:01 PM
To: “Srini Sriram (Westford Academy)” <srinivassriram06@gmail.com>, “Abhisar Anand (Westford Academy)” <abhisar.muz@gmail.com>
Cc: “Dr. Stephen J. Williams” <sjwilliamspa@comcast.net>, “Prof. Marcus W Feldman” <mfeldman@stanford.edu>
Subject: SOP Web STAT | Leaders in Pharmaceutical Business Intelligence (LPBI) Group
- Created during 2021 Summer Internship at LPBI, 6/15 – 8/24/2021
- Tasks are given for performance by Dr. Lev-Ari
- avivalev-ari@alum.berkeley.edu 617-775-0451, https://pharmaceuticalintelligence.com
- The solution to the problem specified by the series of tasks was developed independently by Srinivas Sriram and Abhisar Anand without input from any member of the LPBI team. Thus, it represents an original solution created by us. It includes the idea of how to extract the data and the code to execute the solution. The Tasks formulate a Problem solved by Srinivas Sriram and Abhisar Anand. It represents a creative solution. The solution concept was implemented in code produced by Srinivas Sriram and Abhisar Anand, as authors of the solution and of the code not as its owners.
- The Code was tested on LPBI data of 6,080 articles and +2MM views were extracted, 2012 – 2021.
- We, Srinivas Sriram and Abhisar Anand disclaim ownership of the data on which the idea materialized in Python code and was tested.
- We, Srinivas Sriram and Abhisar Anand devised a solution to the problem posed by Dr. Lev-Ari and we developed the code to solve the problem.
- LPBI has access to the code for recurrent and unlimited use by 2021 Summer Internship conditions by which the software was created during the period mentioned above.
- This Project represents an IT initiative as documented in https://pharmaceuticalintelligence.com/2019-vista/summer-2019-plan-research-associates-tasks/
- The last run in Excel for 6/30/2020 was shared with Srinivas Sriram and Abhisar Anand – code by Alex Crystal, LPBI Intern in 2019.
- In the eventuality of LPBI Exit – this code is part of IP Asset Class IV: Platforms and Composition of Methods, to which the following related statistical modeling belongs as well
- https://pharmaceuticalintelligence.com/vision/pharmaceuticalintelligence-com-journal-projecting-the-annual-rate-of-article-views/
- This statistical method for Prediction of future Article Views, 2021-2025 was implemented in Matlab and the model was designed in-House by the two authors.
- The Article Views data extractions method devised by Srinivas Sriram and Abhisar Anand will be used on 12/31/2021 for 1/1/2022 rerun of the Matlab Programs on the new data, 6/30/2020 to 12/31/2021 – to be treated as actual data to be used for the Prediction of Article Views for 2022-2025
- LPBI views this project and the solution devised as a mission-critical to monitoring and using the Website Statistics.
- We discussed with Dr. Lev-Ari the following:
- The desire to bring the software to shrink-wrap status and approach WordPress.com for their potential adoption and offering it to all other websites they are hosting. That activity needs to be pursued jointly with Dr. Lev-Ari, per 1, 2, 4, 5, 6, 7, 8, 9, above
- If WordPress.com will purchase the shrink-wrap software a sharing agreement will be discussed. A fair arrangement would be 50% each party: Srinivas, Abhisar, and LPBI, data owner, opportunity to work on this problem, evidence of PAST projects on Article Views data and software for Views Extraction.
- The desire to bring the software to the Competitions and software Hackathon and External Professional reviewers are fully supported by Dr. Lev-Ari. These activities need to be pursued jointly with Dr. Lev-Ari, per 1, 2, 4, 5, 6, 7, 8, 9, above
- Dr. Lev-Ari will serve as a Mentor and Letter of Recommendation Writer (LOR) for Srinivas Sriram and Abhisar Anand in their academic future.
- Dr. Lev-Ari will serve as an Advocate and Promoter for all activities involving Srinivas Sriram and Abhisar Anand related to
- Current and Future LPBI involvement
- Academic pursuits
- Competition pursuits
- Relations with WordPress.com
- External Reviewers of all code written for LPBI by Srinivas Sriram and Abhisar Anand
Task Description given by Dr. Lev-Ari
1st Project: Data Science Project
STARTS
6/15/2021 – 8/15/2021
Task 1: Write CODE in Python HOW TO GENERATE automatically a download of all Journal articles N=6.050)
by Views by Year since date of Publication
2012-2021: Journal articles by Views. Journal has 2MM views and 6,050 articles
We have a Run 2012 till 6/30/2020
We need one for 2012 till 12/31/2020
We need one 6/30/2020 till 6/30/2021
We need one 1/1/2020 – 6/30/2020
We need one 7/1/2020 till 12/31/2020
We need one run performed on 12/31/2021 since 2012
Task 2: Create Documentation
We need to have Python code documented so another INTERN will be able to take over that TASK use your code to run the data.
Task 3: Run the Data
This page has the following sub pages.