How to get the gene ID

There is a brute force method for this. You could upload the fasta sequence in tblastn and keep a filter of 100% sequence cover and blast. Usually the first hit should give you the Genbank/Refsec ID for your protein sequence.

The next one will require some scripting, but if you can run this in a loop, it will be faster than the previous method. Here, you can search for the protein ID in NCBI protein, you will get the following:

LOCUS QACA_STAAU 514 aa linear BCT
03-MAY-2023 DEFINITION RecName: Full=Antiseptic resistance protein.
ACCESSION P0A0J9 VERSION P0A0J9.1 DBSOURCE UniProtKB: locus
QACA_STAAU, accession P0A0J9;
class: standard.
extra accessions:P23215
plasmid:VRSAp,pSK1
created: Mar 1, 2005.
sequence updated: Mar 1, 2005.
annotation updated: May 3, 2023.
xrefs: X56628.1, CAA39963.1, S12394, WP_000622776.1,
YP_003813123.1, YP_536864.1
xrefs (non-sequence databases): AlphaFoldDB:P0A0J9, SMR:P0A0J9,
TCDB:2.A.1.3.4, GeneID:66840931, OMA:CFQTIFY, GO:0005886,
GO:0022857, GO:0046677, CDD:cd17321, Gene3D:1.20.1250.20,
InterPro:IPR011701, InterPro:IPR020846, InterPro:IPR036259,
InterPro:IPR005829, PANTHER:PTHR42718:SF42, PANTHER:PTHR42718,
Pfam:PF07690, SUPFAM:SSF103473, PROSITE:PS50850, PROSITE:PS00216 KEYWORDS Antibiotic resistance; Cell membrane;
Membrane; Plasmid;
Transmembrane; Transmembrane helix; Transport.

You can see the GeneID there, 66840931. If this number is searched in NCBI gene, you will get an output that looks like this:

You can donwload the fasta from here.
Hope this helps.

Read more here: Source link