Methodology & Data Sources
Primary Data Sources
PlainPatent combines federal patent datasets and the standards bodies that govern them:
- USPTO PatentsView — research platform built in partnership with USPTO providing cleaned, disambiguated bulk data exports from the patent grant database. We use the 2015–2025 grants dataset (3M+ patents). Licensed CC BY 4.0 International.
- USPTO Patent Public Search — official USPTO public search interface for patents and applications, used as canonical reference for patent number formats and grant status.
- USPTO Open Data Portal — the agency's open data initiative including bulk applications, examiner data, and PAIR (Patent Application Information Retrieval) data.
- MPEP — Manual of Patent Examining Procedure — the canonical USPTO manual referenced for examination process, claim interpretation, and patent classifications used throughout site narrative.
- Cooperative Patent Classification (CPC) — joint USPTO/EPO classification scheme; the canonical authority for technology-domain framing on this site.
- 35 U.S.C. — US Patent Statute — federal patent law (Title 35) referenced for substantive patent-law framing including utility (§101), novelty (§102), nonobviousness (§103), and disclosure (§112) requirements.
What We Track
- Company (Assignee) Rankings: Patent counts by organization, with year-over-year trend analysis and technology diversity scores.
- CPC Technology Classifications: Cooperative Patent Classification codes identifying the technical domain of each patent. CPC is a hierarchical system developed jointly by the USPTO and European Patent Office.
- Patent Details: Filing dates, grant dates, inventors, assignees, and forward citation counts.
- Technology Trends: Annual patent volume by CPC class, used to identify growing and declining technology areas.
Processing Pipeline
We download USPTO PatentsView bulk data files (patent grants, assignees, CPC classifications, and citation data), process them through our ETL pipeline, and compute aggregated statistics:
- Assignee disambiguation is handled by PatentsView — we use their cleaned assignee IDs rather than performing our own entity resolution
- Patent grant counts reflect USPTO records without modification
- Technology diversity scores count the number of distinct CPC subgroups (4-character) in an assignee's patent portfolio
- Trend analysis computes year-over-year growth rates in patent volumes by company and technology class
- Innovation scores are computed from a composite of patent volume, technology diversity, citation impact, and temporal consistency of filing activity
Our pipeline processes each patent record individually, preserving the original USPTO data while computing derived metrics at the company and technology-class levels. No patent data is fabricated, interpolated, or editorially modified. All base counts and dates come directly from the PatentsView dataset.
Innovation Score Methodology
PlainPatent assigns each company an innovation score based on four factors: total patent volume relative to the full dataset, breadth of CPC technology classes represented in the portfolio, forward citation counts indicating downstream influence, and the consistency of patenting activity across the 2015–2025 window. The score is a relative ranking tool — it does not measure commercial success, revenue impact, or patent quality in an absolute sense. Companies with narrow but highly cited portfolios may score differently than those with broad but lightly cited ones.
Update Schedule
USPTO PatentsView releases updated bulk data quarterly. We update our database when new data becomes available, typically within 30 days of each PatentsView release. Between updates, newly granted patents will not appear in our database until the next refresh cycle. Historical data remains stable across updates — PatentsView does not retroactively remove records, though assignee disambiguation may be refined.
Limitations
- Patent counts reflect granted patents, not applications. Many applications are pending for 2–4 years before grant or rejection.
- A company's patent count does not directly measure innovation quality or commercial value. Some high-value inventions generate one patent; some low-value ones generate dozens.
- PatentsView assignee disambiguation is not perfect. Some patents may be misattributed to company subsidiaries or predecessor entities.
- Coverage is US patents only. Companies may have significant international patent portfolios not reflected here.
- Forward citation counts are cumulative and favor older patents — recently granted patents have had less time to accumulate citations.
- CPC classification is assigned by USPTO examiners and may not perfectly capture the commercial application of a technology.
Not Affiliated
PlainPatent is not affiliated with the United States Patent and Trademark Office, PatentsView, or any government agency. We are an independent data portal presenting publicly available patent data in a more accessible format.